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COMPOUNDS THAT ^BIT IJ^ACTON BET^S.GNA^ANSDUCmG PROTEINS 
AND THE GLGF (PDZ/DHR) DOMAIN AND USES THEREOF 

The invention disclosed herein was made with Government 
support under Grant No. R01GM55147-01 from the National 
institutes of Health of the United States Department of 
Health and Human Services. Accordingly, the U.S. 
Government has certain rights in this invention. 



BACKGROUND 



15 Throughout this application, various publications are 

referenced by author and date. Full citations for these 
publications may be found listed alphabetically at the 
end of the specification immediately preceding Sequence 
Listing and the claims. The disclosures of these 
20 publications in their entireties are hereby incorporated 

by reference into this application in order to more fully 
describe the state of the art as known to those skilled 
therein as of the date of the invention described and 
claimed herein. 



25 



30 



35 



Fas (APO-1/CD95) and its ligand have been identified as 
important signal -mediators of apoptosis (Itoh, et al . 
1991) The structural organization of Fas (APO-1/CD95) has 
suggested that it is a member of the tumor necrosis 
factor receptor superfamily, which also includes the p75 
nerve growth factor receptor (NGFR) (Johnson, et al . 
1986), the T-cell-activation marker CD27 (Camerini, et 
al. 1991), the Hodgkin-lymphoma-associated antigen CD30 
(Smith, et al. (1993), the human B cell antigen CD40 
(Stamenkovic, et al . 1989) , and T cell antigen OX40 
(Mallett, et al . 1990). Genetic mutations of both Fas 
and its ligand have been associated with 
lymphoproliferative and autoimmune disorders in mice 
(Watanabe-Fukunaga, et al . 1992; Takahashi, et al . 1994). 
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Furthermore, alterations of Fas expression level have 
been thought to lead to the induction of apoptosis in 
T-cells infected with human immunodeficiency virus (HIV) 
(Westendorp, et al . 1995). 

5 

Several Fas-interacting signal transducing molecules, 
such as Fas-associated phosphatase- 1 (FAP-1) (Figure 1) 
(Sato, et al. 1995) FADD/MORTl/CAP-l/CAP-2 (Chinnaiyan, 
et al. 1995; Boldin, et al . 1995; Kischkel, et al . 1995) 

10 and RIP (Stanger, et al . 1995), have been identified 

using yeast two-hybrid and biochemical approaches. All 
but FAP-1 associate with the functional cell death domain 
of Fas and overexpression of FADD/M0RT1 or RIP induces 
apoptosis in cells transfected with these proteins. In 

15 contrast, FAP-1 is the only protein that associates with 

the negative regulatory domain (C-terminal 15 amino 
acids) (Ito, et al . 1993) of Fas and that inhibits 
Fas -induced apoptosis. 

2 0 FAP-1 (PTPN13) has several alternatively- spliced forms 

that are identical to PTP-BAS/hPTPlE/PTPLl , (Maekawa, et 
al. 1994; Banville, et al . 1994; Saras, et al . 1994) and 
contains a membrane -binding region similar to those found 
in the cytoskeleton-associated proteins, ezrin, (Gould et 
25 al. 1989) radixin (Funayama et al. 1991) moesin (Lankes, 

et al . 1991), neurofibromatosis type II gene product 
(NFII) (Rouleau, et al . 1993), and protein 4.1 (Conboy, 
et al. 1991), as well as in the PTPases PTPH1 (Yang, et 
al. 1991), PTP-MEG (Gu, et al . 1991), and PTPD1 (Vogel, 
30 et al . 1993). FAP-1 intriguingly contains six GLGF 

(PDZ/DHR) repeats that are thought to mediate intra-and 
inter-molecular interactions among protein domains. The 
third GLGF repeat of FAP-1 was first identified as a 
domain showing the specific interaction with the 
35 C-terminus of Fas receptor (Sato, et al . 1995). This 

suggests that the GLGF domain may play an important role 
in targeting proteins to the submembranous cytoskeleton 
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and/or in regulating biochemical activity. GLGF repeats 
have been previously found in guanylate kinases, as well 
as in the rat post-synaptic density protein (PSD-95) (Cho. 
et al 1992), which is a homolog of the Drosophila tumor 
suppressor protein, lethal- (1) -disc-large-1 [dlg-U 
(Woods, et al 1991; Kitamura, et al . 1994). These 
repeats may mediate homo- and hetero-dimerization, whxch 
could potentially influence PTPase activity, bindxng to 
Fas, and/or interactions of FAP-l with other signal 
transduction proteins. Recently, it has also been 
reported that the different PDZ domains of proteins 
interact with the C-terminus of ion channels and other 
proteins (Figure 1) (TABLE 1) (Kornau, et al . 1995; Kxm, 
et al. 1995; Matsumine, et al . 1996). 




NR2 subunit 
Shaker -type K+ 
channel 



TDV 



PSD95 & DLG 
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SUMMARY OF THE INVENTION 

This invention provides a composition capable of 
inhibiting specific binding between a signal- transducing 
5 protein and a cytoplasmic protein containing the amino 

acid sequence (G/S/A/E) -L-G- (F/I/L) (Sequence I.D. No.: 
1) . Further, the cytoplasmic protein may contain the 
amino acid sequence (K/R/Q) -X n - (G/S/A/E) -L-G- (F/I/L) 
(Sequence I.D. No.: 2), wherein X represents any amino 

10 acid which is selected from the group comprising the 

twenty naturally occurring amino acids and n represents 
at least 2, but not more than 4. In a preferred 
embodiment, the amino acid sequence is SLGI (Sequence 
I.D. No.: 3). Further, the invention provides for a 

15 composition when the signal -transducing protein has at 

its carboxyl terminus the amino acid sequence (S/T) -X- 
(V/I/L) (Sequence I.D. No.: 4), wherein each - represents 
a peptide bond, each parenthesis encloses amino acids 
which are alternatives to one other, each slash within 

20 such parentheses separating the alternative amino acids, 

and the X represents any amino acid which is selected 
from the group comprising the twenty naturally occurring 
amino acids . 

25 This invention also provides for a method of identifying 

a compound capable of inhibiting specific binding between 
a signal -transducing protein and a cytoplasmic protein 
containing the amino acid sequence (G/S/A/E) -L-G- (F/I/L) . 
Further this invention provides for a method of 

3 0 identifying a compound capable of inhibiting specific 

binding between a signal -transducing protein having at 
its carboxyl terminus the amino acid sequence (S/T) -X- 
(V/L/I) and a cytoplasmic protein. 

35 This invention also provides for a method inhibiting the 

proliferation of cancer cells, specifically, where the 
cancer cells are derived from organs comprising the 
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colon, liver, breast, ovary, testis, lung, stomach, 
spleen, kidney, prostate, uterus, skin, head, thymus and 
neck, or the cells are derived from either T-cells or B- 
cells . 

5 

This invention also provides for a method of treating 
cancer in a subject in an amount of the composition of 
effective to result in apoptosis of the cells, 
specifically, where the cancer cells are derived from 
10 organs comprising the thymus, colon, liver, breast, 

ovary, testis, lung, stomach, spleen, kidney, prostate, 
uterus, skin, head and neck, or the cells are derived 
from either T-cells or B-cells. 

15 This invention also provides for a method of inhibiting 

the proliferation of virally infected cells, specifically 
wherein the virally infected cells are infected with the 
Hepatitis B virus, Epstein-Barr virus, influenza virus, 
Papilloma virus, Adenovirus, Human T-cell lymphtropic 

20 virus, type 1 or HIV. 

This invention also provides a pharmaceutical composition 
comprising compositions capable of inhibiting specific 
binding between a signal-transducing protein and a 
25 cytoplasmic protein. 

This invention also provides a pharmaceutical composition 
comprising compounds identified to be capable of 
inhibiting specific binding between a signal -transducing 
3 0 protein and a cytoplasmic protein. 
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BRIEF DESCRIPTION OF THE FIGURES 



Figure 1. Diagram of Fas-associated phosphatase-1 
protein, showing the six GLGF (PDZ/DHR) domain repeats; 
5 comparison of similar membrane binding sites with other 

proteins and proteins that contain GLGF (PDZ/DHR) 
repeats . 



10 



15 



20 



25 



Figures 2A, 2B, 2C and 2D. Mapping of the minimal region 
of the C- terminal of Fas required for the binding to 
FAP-1 . Numbers at right show each independent clone 
(Figures 2C and 2D) . 

2A. Strategy for screening of a random peptide library 
by the yeast two-hybrid system. 

Alignment of the C-terminal 15 amino acids of Fas 
between human (Sequence I.D. No.: 5), rat (Sequence 
I.D. No. : 6) , and mouse (Sequence I.D. No. : 7) . 
The results of screening a semi-random peptide 
library. Top row indicates the amino acids which 
were fixed based on the homology between human and 
rat. Dash lines show unchanged amino acids. 
The results of screening a random peptide library 
(Sequence I.D. 



2B. 



2C. 



2D. 



Sequence 
Sequence 
Sequence 
Sequence 



I.D. 
I.D. 
I.D. 
I.D. 



No. 


: 8, 


Sequence 


I 


.D. 


No. : 


9, 


No. : 


10, 


Sequence 


I . 


D. 


No. : 


11, 


No. : 


12, 


Sequence 


I . 


D. 


No..: 


13, 


No. : 


14, 


Sequence 


I . 


D. 


No. : 


15, 


NO. : 


16, 


Sequence 


I . 


D. 


NO. : 


17, 



respectively) . 



3 0 Figures 3A, 3B and 3C. Inhibition assay of Fas/FAP-1 

binding in vitro. 

3A. Inhibition assay of Fas/FAP-1 binding using the 
C-terminal 15 amino acids of Fas. GST-Fas fusion 
protein (191-355) was used for in vitro binding 
3 5 assay (lane 1, 3-10) . GST-Fas fusion protein 

(191-320) (lane 2) and 1 mM human PAMP (N-terminal 
2 0 amino acids of proadrenomedullin, M.W. 2460.9) 
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(lane 3) were used as negative controls. The 
concentrations of the C-terminal 15 amino acids 
added were 1 (lane 4), 3 (lane 5), 10 (lane 6), 30 

(lane 7), 100 (lane 8), 300 (lane 9), and 1000 nM 

5 (lane 10) . 

3B. Inhibition assay of Fas/FAP-1 binding using the 
truncated peptides corresponding to the C-terminal 
15 amino acids of Fas. All synthetic peptides were 
acetylated for this inhibition assay (Sequence I.D. 
10 no.: 4, Sequence I.D. No.: 18, Sequence I.D. No.: 

19, Sequence I.D. No.: 20, Sequence I.D. No.: 21, 
Sequence I.D. No.: 22, Sequence I.D. No.: 23, 
respectively) . 

3C. inhibitory effect of Fas/FAP-1 binding using the 
15 scanned tripeptides . 

Figures 4A, 4B, 4C and 4D. 

4A. interaction of the C-terminal 3 amino acids of Fas 

with FAP-1 in yeast. 
20 4B. interaction of the C-terminal 3 amino acids of Fas 

with FAP-1 in vitro. 
4C. immuno-precipitation of native Fas with GST-FAP-1. 
4D. Inhibition of Fas/FAP-1 binding with Ac-SLV or Ac- 

SLY. 

Figures 5A, 5B. 5C, 5D, 5E and 5F. Microinjection of 
Ac-SLV into the DLD-1 cell line. Triangles identify the 
cells both that were could be microin j ected with Ac-SLV 
and that showed condensed chromatin identified. On the 
3 0 other hand, only one cell of the area appeared apoptotic 

when microin j ected with Ac -SLY. 

5A. Representative examples of the cells microinjected 
with Ac-SLV in the presence of 500 ng/ml CH11 are 
shown in phase contrast. 
35 5B. Representative examples of the cells microinjected 

with AC-SLY in the presence of 500 ng/ml CH11 are 
shown in phase contrast . 
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5C. Representative examples of the cells microinjected 
with Ac-SLV in the presence of 500 ng/ml CH11 are 
shown stained with FITC. 
5D. Representative examples of the cells microinjected 
5 with AC-SLY in the presence of 500 ng/ml CH11 are 

shown stained with FITC. 
5E. Representative examples of the cells microinjected 
with Ac-SLV in the presence of 500 ng/ml CH11 are 
shown with fluorescent DNA staining with Hoechst 
10 33342. 

5F. Representative examples of the cells microinjected 
with AC-SLY in the presence of 500 ng/ml CH11 are 
shown in fluorescent. DNA staining with Hoechst 
33342 . 

15 

Figure 6. Quantitation of apoptosis in microinjected 
DLD-l cells. 

Figures 7A, 7B, 7C, 7D, 7E, 7F, 7G, and 7H. 
20 7A. Amino acid sequence of human nerve growth factor 

receptor (Sequence I.D. No.: 24). 
7B. Amino acid sequence of human CD4 receptor (Sequence 
I.D. No. 25) . 

7C. The interaction of Fas-associated phosphatase-1 to 
25 the C-terminal of nerve growth factor receptor 

(NGFR) (p75) . 

7D. Amino acid sequence of human colorectal mutant 

cancer protein (Sequence I.D. No.: 26). 
7E. Amino acid sequence of protein kinase C, alpha type. 
30 7F. Amino acid sequence of serotonin 2A receptor 

(Sequence I,D. No.: 27). 
7G. Amino acid sequence of serotonin 2B receptor 

(Sequence I.D. No.: 28). 
7H. Amino acid sequence of adenomatosis polyposis coli 
35 protein (Sequence I.D. No.: 29). 
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Figure 8. Representation of the structural 
characteristics of p75 NGFR (low-affinity nerve growth 
factor receptor) . 

5 Figure 9. Comparison of the C-terminal ends of Fas and 

p75 NGFR. 

Figure 10. In vitro interaction of 3S S-labeled FAP-l with 
various receptors expressed as GST fusion proteins. The 
10 indicated GST fusion proteins immobilized on glutathione - 

Sepharose beads were incubated with in vitro translated, 
3S S- labeled FAP-l protein. After the beads were washed, 
retained FAP-l protein was analyzed by SDS-PAGE and 
autoradiography . 



15 



20 



25 



30 



Figures 11A and 11B. In vitro interaction 35 S-labeled 
FAP-l with GST-p75 deletion mutants. 

HA. Schematic representation of the GST fusion 

proteins containing the cytoplasmic domains of 
p75 and p75 deletion mutants. Binding of FAP- 
1 to the GST fusion proteins with various p75 
deletion mutants is depicted at the right and 
is based on data from (11B) . 

11B. Interaction of in vitro translated, 35 S-labeled 

FAP-l protein with various GST fusion proteins 
immobilized on glutathione -Sepharose beads. 
After the beads were washed, retained FAP-l 
protein was analyzed by SDS-PAGE and 
autoradiography . 



Figure 12. The association between LexA-C- terminal 
cytoplasmic region of p75NGFR and VP16-FAP-1. The 
indicated yeast strains were constructed by 
transformation and the growth of colonies was tested. 
35 +/- indicates the growth of colonies on his * plate. 
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DETAILED DESCRIPTION OF THE INVENTION 

As used herein, amino acid residues are abbreviated as 
5 follows: A # Ala; C, Cys ; D, Asp; E, Glu; F, Phe ; G, Gly; 

H, His; I, lie; K, Lys ; L, Leu; M, Met; N, Asn; P, Pro; 
Q, Gin; R, Arg; S, Ser; T, Thr; V, Val ; W, Trp; and Y, 
Tyr . 

10 In order to facilitate an understanding of the material 

which follows, certain frequently occurring methods 
and/or terms are best described in Sambrook, et al . , 
1989. 

15 The present invention provides for a composition capable 

of inhibiting specific binding between a signal - 
transducing protein and a cytoplasmic protein containing 
the amino acid sequence (G/S/A/E) -L-G- (F/I/L) , wherein 
each - represents a peptide bond, each parenthesis 

2 0 encloses amino acids which are alternatives to one other, 

and each slash within such parentheses separating the 
alternative amino acids.. Further, the cytoplasmic 
protein may contain the amino acid sequence (K/R/Q) -X n - 
(G/S/A/E) -L-G- (F/I/L) , wherein X represents any amino acid 

25 which is selected from the group comprising the twenty 

naturally occurring amino acids and n represents at least 
2, but not more than 4. Specifically, in a preferred 
embodiment, the cytoplasmic protein contains the amino 
acid sequence SLGI . 

30 

The amino acid sequence (K/R/Q) -X„- (G/S/A/E) -L-G- (F/I/L) 
is also well-known in the art as "GLGF (PDZ/DHR) amino 
acid domain. " As used herein, *GLGF (PDZ/DHR) amino acid 
domain" means the amino acid sequence (K/R/Q) -X n - 
35 (G/S/A/E) -L-G- (F/I/L) . 



In a preferred embodiment, the signal- transducing protein 
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has at its carboxyl terminus the amino acid sequence 
(S/T) -X- (V/I/L) , wherein each - represents a peptide 
bond, each parenthesis encloses amino acids which are 
alternatives to one other, each slash within such 
5 parentheses separating the alternative amino acids, and 

the X represents any amino acid which is selected from 
the group comprising the twenty naturally occurring amino 
acids . 



10 The compositions of the subject invention may be, but not 

limited to, antibodies, inorganic compounds, organic 
compounds, peptides, peptidomimetic compounds, 
polypeptides or proteins, fragments or derivatives which 
share some or all properties, e.g. fusion proteins. The 

15 composition may be naturally occurring and obtained by 

purification, or may be non-naturally occurring and 
obtained by synthesis. 

Specifically, the composition may be a peptide containing 
20 the sequence (S/T) -X- (V/I/L) -COOH, wherein each - 

represents a peptide bond, each parenthesis encloses 
amino acids which are alternatives to one other, each 
slash within such parentheses separating the alternative 
amino acids, the X represents any amino acid which is 
25 selected from the group comprising the twenty naturally 

occurring amino acids. In preferred embodiments, the 
peptide contains one of the following sequences: 
DSENSNFRNEIQSLV, RNEIQSLV, NEIQSLV, EIQSLV, IQSLV, QSLV, 
SLV, I PPDSEDGNEEQSLV , DSEMYNFRSQLASW , IDLASEFLFLSNSFL , 
3 0 PPTCSQANSGRISTL, SDSNMNMNELSEV , QNFRTYIVSFV , RETIESTV, 

RGFISSLV, TIQSVI, ESLV. A further preferred embodiment 
would be an organic compound which has the sequence Ac- 
SLV-COOH, wherein the Ac represents an acetyl and each - 
represents a peptide bond. 

35 

An example of the subject invention is provided infra . 
Acetylated peptides may be automatically synthesized on 
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an Advanced ChemTech ACT357 using previously published 
procedures by analogy. Wang resin was used for each run 
and ISF-Fmoc protection was used for all amino acids, and 
then 2 0% piperidine/DMF and coupling was completed using 
5 DIC/HOBt and subsequently HBTU/DIEA. After the m last 

amino acid was coupled, the growing peptide on the resin 
was acetylated with Ac 2 0/DMF. The acetylated peptide was 
purified by HPLC and characterized by FAB-MS and 1 H-NMR . 

10 Further, one skilled in the art would know how to 

construct derivatives of the above -described synthetic 
peptides coupled to non- acetyl groups, such as amines. 

This invention also provides for a composition capable of 
15 inhibiting specific binding between a signal -transducing 

protein having at its carboxyl terminus the amino acid 
sequence (S/T) -X- (V/I/L) , wherein each - represents a 
peptide bond, each parenthesis encloses amino acids which 
are alternatives to one other, each slash within such 
20 parentheses separating the alternative amino acids, the 

X represents any amino acid which is selected from the 
group comprising the twenty naturally occurring amino 
acids, and a cytoplasmic protein. 

25 The compositions of the subject invention includes 

antibodies, inorganic compounds, organic compounds, 
peptides, peptidomimetic compounds, polypeptides or 
proteins, fragments or derivatives which share some or 
all properties, e.g. fusion proteins. 

30 

This invention also provides a method of identifying a 
compound capable of inhibiting specific binding between 
a signal -transducing protein and a cytoplasmic protein 
containing the amino acid sequence (G/S/A/E) -L-G- (F/I/L) , 
35 wherein each - represents a peptide bond, each 

parenthesis encloses amino acids which are alternatives 
to one other, each slash within such parentheses 
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separating the alternative amino acids, which comprises 
(a) contacting the cytoplasmic protein bound to the 
signal-transducing protein with a plurality of compounds 
under conditions permitting binding between a known 
5 compound previously shown to be able to displace the 

signal-transducing protein bound to the cytoplasmic 
protein and the bound cytoplasmic protein to form a 
complex; and (b) detecting the displaced sxgnal- 
transducing protein or the complex formed in step (a) 
10 wherein the displacement indicates that the compound is 

capable of inhibiting specific binding between the 
signal- transducing protein and the cytoplasmic protexn. 

The inhibition of the specific binding between the 
15 signal-transducing protein and the cytoplasmic protexn 

may affect the transcription activity of a reporter gene. 

Further, in step (b) , the displaced cytoplasmic protein 
or the complex is detected by comparing the transcriptxon 
activity of a reporter gene before and after the 
contacting with the compound in step (a) , where a change 
of the activity indicates that the specific bxndxng 
between the signal-transducing protein and the 
cytoplasmic protein is inhibited and the sxgnal- 
25 transducing protein is displaced. 

As used herein, the "transcription activity of a reporter 
gene" means that the expression level of the reporter 
gene will be altered from the level observed when the 
signal-transducing protein and the cytoplasmic protexn 
are bound. One can also identify the compound by 
detecting other biological functions dependent on the 
binding between the signal -transducing protein and the 
cytoplasmic protein. Examples of reporter genes are 
35 numerous and well-known in the art, including, but not 

limited to, histidine resistant genes, ampxcxllxn 
resistant genes, /S-galactosidase gene. 



30 
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Further the cytoplasmic protein may be bound to a solid 
support. Also the compound may be bound to a solid 
support and comprises an antibody, an inorganic compound, 
an organic compound, a peptide, a peptidomimetic 
5 compound, a polypeptide or a protein. 

An example of the method is provided infra. One can 
identify a compound capable of inhibiting specific 
binding between the signal- transducing protein and the 
10 cytoplasmic protein using direct methods of detection 

such as immuno-precipitation of the cytoplasmic protein 
and the compound bound to a detectable marker. Further, 
one could use indirect methods of detection that would 
detect the increase or decrease in levels of gene 

15 expression. As discussed infra , one could construct 

synthetic peptides fused to a LexA DNA binding domain. 
These constructs would be transformed into the L4 0- strain 
with an appropriate cell line having an appropriate 
reporter gene. One could then detect whether inhibition 

2 0 had occurred by detecting the levels of expression of the 

reporter gene. In order to detect the expression levels 
of the reporter gene, one skilled in the art could employ 
a variety of well-known methods, e.g. two-hybrid systems 
in yeast, mammals or other cells. 



25 



30 



Further, the contacting of step (a) may be in vitro, in 
vivo, and specifically in an appropriate cell, e.g. yeast 
cell or mammalian cell. Examples of mammalian cells 
include, but not limited to, the mouse fibroblast cell 
NIH 3T3, CHO cells, HeLa cells, Ltk" cells, Cos cells, 
etc . 



Other suitable cells include, but are not limited to, 
prokaryotic or eukaryotic cells, e.g. bacterial cells 
35 (including gram positive cells), fungal cells, insect 

cells, and other animals cells. 
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Further, the signal- transducing protein may be a cell 
surface receptor, signal transducer protein, or a tumor 
suppressor protein. Specifically, the cell surface 
protein is the Fas receptor and may be expressed in cells 
5 derived from organs including, but not limited to, 

thymus, liver, kidney, colon, ovary, breast, testis, 
spleen, lung, stomach, prostate, uterus, skin, head, and 
neck, or expressed in cells comprising T-cells and B- 
cells. In a preferred embodiment, the T-cells are Jurkat 
10 T-cells. 

Further, the cell -surface receptor may be a CD4 receptor, 
p7 5 receptor, serotonin 2A receptor, or serotonin 2B 
receptor. 



15 



25 



30 



35 



Further, the signal transducer protein may be Protein 
Kinase -C-CK- type. 

Further, the tumor suppressor protein may be a 
20 adenomatosis polyposis coli tumor suppressor protein or 

colorectal mutant cancer protein. 

Further, the cytoplasmic protein contains the amino acid 
sequence SLGI, specifically Fas-associated phosphatase- 1 . 



This invention also provides a method of identifying a 
compound capable of inhibiting specific binding between 
a signal -transducing protein having at its carboxyl 
terminus the amino acid sequence (S/T) -X- (V/I/L) , wherein 
each - represents a peptide bond, each parenthesis 
encloses amino acids which are alternatives to one other, 
each slash within such parentheses separating the 
alternative amino acids, the X represents any amino acid 
which is selected from the group comprising the twenty 
naturally occurring amino acids, and a cytoplasmic 
protein which comprises (a) contacting the signal- 
transducing protein bound to the cytoplasmic protein with 
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a plurality of compounds under conditions permitting 
binding between a known compound previously shown to be 
able to displace the cytoplasmic protein bound to the 
signal -transducing protein and bound signal -transducing 
5 protein to form a complex; and (b) detecting the 

displaced cytoplasmic protein or the complex of step (a) , 
wherein the displacement indicates that the compound is 
capable of inhibiting specific binding between the 
signal -transducing protein and the cytoplasmic protein. 

10 The inhibition of the specific binding between the 

signal- transducing protein and the cytoplasmic protein 
affects the transcription activity of a reporter gene. 
Further, in step (b) , the displaced signal -transducing 
protein or the complex is detected by comparing the 

15 transcription activity of a reporter gene before and 

after the contacting with the compound in step (a) , where 
a change of the activity indicates that the specific 
binding between the signal -transducing protein and the 
cytoplasmic protein is inhibited and the cytoplasmic 

20 protein is displaced. 

Further, in step (b) , the displaced cytoplasmic protein 
or the complex is detected by comparing the transcription 
activity of a reporter gene before and after the 
25 contacting with the compound in step (a) , where a change 

of the activity indicates that the specific binding 
between the signal- transducing protein and the 
cytoplasmic protein is inhibited and the signal- 
transducing protein is displaced. 

30 

As used herein, the "transcription activity of a reporter 
gene" means that the expression level of the reporter 
gene will be altered from the level observed when the 
signal -transducing protein and the cytoplasmic protein 
3 5 are bound. One can also identify the compound by 

detecting other biological functions dependent on the 
binding between the signal -transducing protein and the 
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cytoplasmic protein. Examples of reporter genes are 
numerous and well-known in the art, including, but not 
limited to, histidine resistant genes, ampicillin 
resistant genes, /S-galactosidase gene. 

5 

Further, the cytoplasmic protein may be bound to a solid 
support or the compound may be bound to a solid support, 
comprises an antibody, an inorganic compound, an organic 
compound, a peptide, a peptidomimetic compound, a 
10 polypeptide or a protein. 

An example of the method is provided infra . One could 
identify a compound capable of inhibiting specific 
binding between the signal -transducing protein and the 
cytoplasmic protein using direct Methods of detection 
such as immuno-precipitation of the cytoplasmic protein 
and the compound bound with a detectable marker. 
Further, one could use indirect methods of detection that 
would detect the increase or decrease in levels of gene 
expression. As discussed infra, one could construct 
synthetic peptides fused to a LexA DNA binding domain. 
These constructs would be transformed into L40-strain 
with an appropriate cell line having a reporter gene. 
One could then detect whether inhibition had occurred by 
detecting the levels of the reporter gene. Different 
methods are also well known in the art, such as employing 
a yeast two-hybrid system to detect the expression of a 
reporter gene. 



15 



20 



25 



30 



35 



Further the contacting of step (a) can be in vitro or in 
vivo , specifically in a yeast cell or a mammalian cell. 
Examples of mammalian cells include, but not limited to, 
the mouse fibroblast cell NIH 3T3, CHO cells, HeLa cells, 
Ltk" cells, Cos cells, etc. 

Other suitable cells include, but are not limited to, 
prokaryotic or eukaryotic cells, e.g. bacterial cells 
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( including gram positive cells) , fungal cells, insect 
cells, and other animals cells. 

Further, the signal- transducing protein is a cell surface 
receptor, signal transducer protein, or a tumor 
suppressor protein. Specifically, the cell surface 
protein is the Fas receptor and is expressed in cells 
derived from organs comprising thymus, liver, kidney, 
colon, ovary, breast, testis, spleen, stomach, prostate, 
uterus, skin, head and neck, or expressed in cells 
comprising T-cells and B-cells. In a preferred 

embodiment, the T-cells are Jurkat T-cells. 



5 



10 



Further, the cell -surface receptor may be a CD4 receptor, 
15 p75 receptor, serotonin 2A receptor, or serotonin 2B 

receptor. 

Further, the signal transducer protein may be Protein 
Kinase-C-a-type . 

20 

Further, the tumor suppressor protein may be a 
adenomatosis polyposis coli tumor suppressor protein or 
colorectal mutant cancer protein. 



25 Further, the cytoplasmic protein contains the amino acid 

sequence SLGI, specifically Fas-associated phosphatase- 
1. 



This invention also provides a method of inhibiting the 
3 0 proliferation of cancer cells comprising the above- 

described composition, specifically, wherein the cancer 
cells are derived from organs including, but not limited 
to, thymus, liver, kidney, colon, ovary, breast, testis, 
spleen, stomach, prostate, uterus, skin, head and neck, 
3 5 or wherein the cancer cells are derived from cells 

comprising T-cells and B-cells. 
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This invention also provides a method of inhibiting the 
proliferation of cancer cells comprising the compound 
identified by the above -described method, wherein the 
cancer cells are derived from organs including, but not 
5 limited to, thymus, liver, kidney, colon, ovary, breast, 

testis, spleen, stomach, prostate, uterus, skin, head and 
neck, or wherein the cancer cells are derived from cells 
comprising T-cells and B-cells. 

10 The invention also provides a method of treating cancer 

in a subject which comprises introducing to the subject's 
cancerous cells an amount of the above -described 
composition effective to result in apoptosis of the 
cells, wherein the cancer cells are derived from organs 

15 including, but not limited to, thymus, liver, kidney, 

colon, ovary, breast, testis, spleen, stomach, prostate, 
uterus, skin, head and neck, or wherein the cancer cells 
are derived from cells comprising T-cells and B-cells. 



20 



As used herein "apoptosis" means programmed cell death of 
the cell. The mechanisms and effects of programmed cell 
death differs from cell lysis. Some observable effects 
of apoptosis are: DNA fragmentation and disintegration 
into small membrane -bound fragments called apoptotic 
25 bodies. 

Means of detecting whether the composition has been 
effective to result in apoptosis of the cells are well- 
known in the art. One means is by assessing the 
30 morphological change of chromatin using either phase 

contrast or fluorescence microscopy. 

The invention also provides for a method of inhibiting 
the proliferation of virally infected cells comprising 
35 the above -described composition or the compound 

identified by the above -described, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein-Barr 
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virus, influenza virus, Papilloma virus, Adeno virus, 
Human T-cell lymphtropic virus, type 1 or HIV. 



The invention also provides a method of treating a 
5 virally- infected subject which comprises introducing to 

the subject's virally- infected cells the above-described 
composition effective to result in apoptosis of the cells 
or the compound identified by the above -described method 
of claim 27 effective to result in apoptosis of the 
10 cells, wherein the virally infected cells comprise the 

Hepatitis B virus, Epstein-Barr virus, influenza virus, 
Papilloma virus, Adeno virus, Human T-cell lymphtropic 
virus, type 1 or HIV. 

15 Means of detecting whether the composition has been 

effective to result in apoptosis of the cells are well- 
known in the art. One means is by assessing the 
morphological change of chromatin using either phase 
contrast or fluorescence microscopy. 

20 

This invention also provides for a pharmaceutical 
composition comprising the above -described composition of 
in an effective amount and a pharmaceutically acceptable 
carrier. 

25 

This invention also provides for a pharmaceutical 
composition comprising the compound identified by the 
above -described method of in an effective amount and a 
pharmaceutically acceptable carrier. 

30 

This invention further provides a composition capable of 
specifically binding a signal- transducing protein having 
at its carboxyl terminus the amino acid sequence (S/T) -X- 
(V/L/I) , wherein each - represents a peptide bond, each 
3 5 parenthesis encloses amino acids which are alternatives 

to one other, each slash within such parentheses 
separating the alternative amino acids, and the X 
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represents any amino acid which is selected from the 
group comprising the twenty naturally occurring amino 
acids. The composition may contain the amino acid 
sequence (G/S/A/E) -L-G- (F/I/L) , wherein each - represents 
5 a peptide bond, each parenthesis encloses amino acids 

which are alternatives to one other, and each slash 
within such parentheses separating the alternative amino 
acids. In a preferred embodiment, the composition 
contains the amino acid sequence (K/R/Q) -X„- (G/S/A/E) -L-G- 
10 (F/I/L) . wherein X represents any amino acid which is 

selected from the group comprising the twenty naturally 
occurring amino acids and n represents at least 2, but 
not more than 4. In another preferred embodiment, the 
composition contains the amino acid sequence SLGI . 

15 

This invention further provides a method for identifying 
compounds capable of binding to a signal-transducing 
protein having at its carboxyl terminus the amino acid 
sequence (S/T) -X- (V/L/I) , wherein each - represents a 

20 peptide bond, each parenthesis encloses amino acids which 

are alternatives to one other, each slash within such 
parentheses separating the alternative amino acids, the 
X represents any amino acid which is selected from the 
group comprising the twenty naturally occurring amino 

25 acids, which comprises (a) contacting the signal- 

transducing protein with a plurality of compounds under 
conditions permitting binding between a known compound 
previously shown to be able to bind to the signal - 
transducing protein to form a complex; and (b) detecting 

30 the complex formed in step (a) so as to identify a 

compound capable of binding to the signal-transducing 
protein. Specifically, the identified compound contains 
the amino acid sequence (G/S/A/E) -L-G- (F/I/L) . In a 
further preferred embodiment, the identified compound 
35 contains the amino acid sequence SLGI. 



Further, in the above -described method, the signal- 
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transducing protein may be bound to a solid support. 
Also, the compound may be bound to a solid support, and 
may comprise an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic compound, 
5 a polypeptide or a protein. 

Further, the signal -transducing protein may be a cell- 
surface receptor or a signal transducer. Specifically, 
the signal- transducing protein may be the Fas receptor, 
10 CD4 receptor, p75 receptor, serotonin 2A receptor, 

serotonin 2B receptor, or protein kinase-C-or- type . 

This invention also provides a method of restoring 
negative regulation of apoptosis in a cell comprising the 
15 above -described composition or a compound identified by 

the above-described method. 

As used herein "restoring negative regulation of 
apoptosis" means enabling the cell from proceeding onto 
20 programmed cell death. 

For example, cells that have functional Fas receptors and 
Fas -associated phosphatase 1 do not proceed onto 
programmed cell death or apoptosis due to the negative 

25 regulation of Fas by the phosphatase. . However, if Fas- 

associated phosphatase 1 is unable to bind to the 
carboxyl terminus of the Fas receptor ( (S/T) -X- (V/L/I) 
region) , e.g. mutation or deletion of at least one of 
the amino acids in the amino acid sequence (G/S/A/E) -L-G- 

30 (F/I/L) , the cell will proceed to apoptosis. By 

introducing a compound capable of binding to the carboxyl 
terminus of the Fas receptor, one could mimic the effects 
of a functional phosphatase and thus restore the negative 
regulation of apoptosis. 



This invention also provides a method of preventing 
apoptosis in a cell comprising the above -described 
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composition or a compound identified by the above- 
described method. 

This invention also provides a means of treating 
5 pathogenic conditions caused by apoptosis of relevant 

cells comprising the above-described composition or the 
compound identified by the above -de scribed method. 

This invention is illustrated in the Experimental Details 
section which follows. These sections are set forth to 
aid in an understanding of the invention but are not 
intended to, and should not be construed to, limit in any 
way the invention as set forth in the claims which follow 
thereafter. 



10 
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FIRST SERIES OF EXPERIMENTS 

Experimental Details 

5 Methods and Materials 

1. Screening a semi-random and random peptide library. 

To create numerous mutations in a restricted DNA 
10 sequence, PGR mutagenesis with degenerate 

oligonucleotides was employed according to a protocol 
described elsewhere (Hill, et al . 1987). Based on the 
homology between human and rat, two palindromic sequences 
were designed for construction of semi -random library. 
15 The two primers used were 

5' - CGGAATTCNNNNNNNNNAACA^ 

NTG AGGATCC TCA- 3 ' (Seq. I.D. No.: 30) and 

5 ' - CGGAATTCGACTCAGAANNNNNNAACTTCAGANN 

CTGAGGATCCTCA- 3 ' (Seq. I.D. No.: 31). Briefly, the two 

20 primers (each 200 pmol) , purified by HPLC, were annealed 

at 70 °C for 5 minutes and cooled at 23 °C for 60 minutes. 
A Klenow fragment (5 U) was used for filling in with a 
dNTP mix (final concentration, 1 mM per each dNTP) at 
23°C for 60 minutes. The reaction was stopped with 1 ptl 

25 of 0 . 5 M EDTA and the DNA was purified with ethanol 

precipitation. The resulting double -stranded DNA was 
digested with EcoRI and BamHI and re -purified by 
electrophoresis on non- denaturing polyacrylamide gels. 
The double-strand oligonucleotides were then ligated into 

3 0 the EcoRI -BamHI sites of the pBTM116 plasmid. The 

ligation mixtures were electroporated into the E. coli 
XLl-Blue MRF' (Stratagene) for the plasmid library. The 
large scale transformation was carried out as previously 
reported. The plasmid library was transformed into 

35 L40-strain cells (MATa, trpl, leu2, h±s3 , ade2 , 

LYS2: (lexAop) 4 -HIS3 , URA3 : : (lexAopf -lacZ) carrying the 
plasmid pVP16-31 containing a FAP-1 cDNA (Sato, et al . 
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1995) . Clones that formed on histidine-def icient medium 
(His*) were transferred to plates containing 4 0 /xg/ml 
X-gal to test for a blue reaction product (fi-gal 4 ) in 
plate and filter assays. The clones selected by His* and 
S-gal* assay were tested for further analysis. The 
palindromic oligonucleotide, 
5' -CGGAATTC- (NNN) 4 _ 15 -TGAGGATCCTCA-3 ' (Seq. I.D. No. 32), 
was used for the construction of the random peptide 
library. 

2 . Synthesis of peptides 



Peptides were automatically synthesized on an Advanced 
ChemTech ACT357 by analogy to published procedures 

15 (Schnorrenberg and Gerhardt, 1989). Wang resin (0.2-0.3 

mmole scale) was used for each run and N a -Fmoc protection 
was employed for all amino acids. Deprotection was 
achieved by treatment with 20% piperidine/DMF and 
coupling was completed using DIC/HOBt and subsequent 

2 0 HBTU/DIEA. After the last amino acid was coupled, the 

growing peptide on the resin was acetylated with Ac 2 0/DMF. 
The peptide was cleaved from the resin with concomitant 
removal of all protecting groups by treating with TFA. 
The acetylated peptide was purified by HPLC and 

25 characterized by FAB-MS and ^-NMR. 

3. Inhibition asssay of Fas/FAP-1 .binding using the C- 
terminal 15 amino acids of Fas . 

30 HFAP-10 cDNA (Sato, et al . 1995) subcloned into the 

Bluescript vector pSK-II (Stratagene) was in 
vitro -translated from an internal methionine codon in the 
presence of 35 S-L-methionine using a coupled in vitro 
transcription/translation system (Promega, TNT lysate) 

35 and T7 RNA polymerase. The resulting 35 S-labeled protein 

was incubated with GST-Fas fusion proteins that had been 
immobilized on GST-Sepharose 4B affinity beads 
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(Pharmacia) in a buffer containing 150 mM NaCl, 50 mM 
Tris [pH 8.0], 5 mM DTT, 2 mM EDTA, 0.1 % NP-40, 1 mM 
PMSF, 50 /xg/ml leupeptin, 1 mM Benzamidine, and 7 /xg/ml 
pepstatin for 16 hours at 4 *C. After washing vigorously 
5 4 times in the same buffer, associated proteins were 

recovered with the glutathione-Sepharose beads by 
centrifugation, eluted into boiling Laemmli buffer, and 
analyzed by SDS-PAGE and f luorography . 

10 4. Inhibition assay of terminal 15 amino acids of Fas 

and inhibitory effect of Fas/FAP-1 binding using 
diverse tripeptides . 



In vitro- translated [ 35 S]HFAP-1 was purified with a NAP- 5 
15 column (Pharmacia) and incubated with 3 /iM of GST-fusion 

proteins for 16 hours at 4*C. After washing 4 times in 
the binding buffer, radioactivity incorporation was 
determined in a b counter. The percentage of binding 
inhibition was calculated as follows: percent inhibition 
20 = [radioactivity incorporation using GST-Fas (191-335) 

with peptides - radioactivity incorporation using GST- Fas 
(191-320) with peptides] / [radioactivity incorporation 
using GST-Fas (191-335) without peptides - radioactivity 
incorporation using GST-Fas (191-320) without peptides] . 
25 n=3. 



5. Interaction of the C-terminal 3 amino acids of Fas 
with FAP-1 in yeast and in vitro. 

30 The bait plasmids, pBTM116 (LexA) -SLV, -PLV, -SLY, and 

-SLA, were constructed and transformed into L40-strain 
with pVP16 -FAP-1 or -ras. Six independent clones from 
each transf ormants were picked up for the analysis of 
growth on histidine-def icient medium. GST-Fas, -SLV, and 

3 5 PLV were purified with GST-Sepharose 4B affinity beads 

(Pharmacia) . The methods for in vitro binding are 
described above . 
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6. Immuno-precipitation of native Fas with GST-FAP-1 
and inhibition of Fas/FAP-1 binding with Ac-SLV. 

GST- fusion proteins with or without FAP-1 were incubated 
5 with cell extracts from Jurkat T-cells expressing Fas. 

The bound Fas was detected by Western analysis using 
anti-Fas monoclonal antibody (F22120, Transduction 
Laboratories) . The tripeptides, Ac-SLV and Ac -SLY were 
used for the inhibition assay of Fas/FAP-1 binding. 

10 

7. Microinjection of Ac-SLV into the DLD-1 cell line. 
DLD-1 human colon cancer cells were cultured in RPMI 1640 
medium containing 10% FCS . For microinjection, cells 
were plated on CELLocate (Eppendorf ) at 1 X 10 5 cells/2 ml 

15 in a 35 mm plastic culture dish and grown for 1 day. Just 

before microinjection, Fas monoclonal antibodies CH11 
(MBL International) was added at the concentration of 500 
ng/ml. All microinjection experiments were performed 
using an automatic microinjection system (Eppendorf 

20 transjector 5246, micro-manipulator 5171 and • Femto tips) 

(Pantel, et al . 1995). Synthetic tripeptides were 
suspended in 0.1% (w/v) FITC-Dextran (Sigma) /K-PBS at the 
concentration of 10 0 mM. The samples were microinjected 
into the cytoplasmic region of DLD-1 cells. Sixteen to 

25 2 0 hours postinjection, the cells were washed with PBS 

and stained with 10 ng/ml Hoechst 33342 in PBS. After 
incubation at 37°C for 3 0 minutes, the cells were 
photographed and the cells showing condensed chromatin 
were counted as apoptotic. 

30 

8. Quantitation of apoptosis in microinjected DLD-1 
cells . 

For each experiment, 25-100 cells were microinjected. 
35 Apoptosis of microinjected cells was determined by 

assessing morphological changes of chromatin using phase 
contrast and fluorescence microscopy (Wang, et al . , 1995; 
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McGahon, et al . , 1995). The data are means +/- S.D. for 
two or three independent determinations. 

Discussion 

5 

In order to identify the minimal peptide stretch in the 
C- terminal region of the Fas receptor necessary for FAP-1 
binding, an in vitro inhibition assay of Fas/FAP-1 
binding was used using a series of synthetic peptides as 

10 well as yeast two-hybrid system peptide libraries (Figure 

2A) . First, semi-random libraries (based on the homology 
between human and rat Fas) (Figures 2B and 2C) of 15 
amino acids fused to a LexA DNA binding domain were 
constructed and co- transformed into yeast strain L4 0 with 

15 pVP16-31 (Sato, et al . 1995) that was originally isolated 

as FAP-1. After the selection of 200 His* colonies from an 
initial screen of 5.0 X 10 6 (Johnson, et al . 1986) 
transf ormants, 100 colonies that were /3-galactosidase 
positive were picked for further analysis. Sequence 

20 analysis of the library plasmids encoding the C-terminal 

15 amino acids revealed that all of the C-termini were 
either valine, leucine or isoleucine residues. Second, 
a random library of 4-15 amino acids fused to a LexA DNA 
binding domain was constructed and screened according to 

25 this strategy (Figure 2D) . Surprisingly, all of the third 

amino acid residues from the C-termini were serine, and 
the results of C-terminal amino acid analyses were 
identical to the screening of the semi -random cDNA 
libraries. No other significant amino acid sequences were 

30 found in these library screenings, suggesting that the 

motifs of the last three amino acids ( tS-X-V/L/I) are 
very important for the association with the third PDZ 
domain of FAP-1 and play a crucial role in 
protein-protein interaction as well as for the regulation 

35 of Fas-induced apoptosis. To further confirm whether the 

last three amino acids are necessary and sufficient for 
Fas/FAP-1 binding, plasmids of the LexA-SLV, -PLV, -PLY, 
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-SLY, and -SLA fusion proteins were constructed and 
co- transformed into yeast with pVP16-FAP-l. The results 
showed that only LexA-SLV associated with FAP-1, whereas 
LexA-PLV, -PLY, -SLY, and -SLA did not (Figure 4A) . In 
5 vitro binding studies using various GST-tripeptide 

fusions and in vitro- translated FAP-1 were consistent 
with these results (Figure 4B) . 



10 



20 



In addition to yeast two-hybrid approaches, in vitro 
inhibition assay of Fas/FAP-1 binding was also used. 
First, a synthetic peptide of the C-terminal 15 amino 
acids was tested whether it could inhibit the binding of 
Fas and FAP-1 in vitro (Figure 3A) . The binding of in 
vitro- translated FAP-1 to GST-Fas was dramatically 
15 reduced and dependent on the concentration of the 

synthetic 15 amino acids of Fas. In contrast with these 
results, human PAMP peptide (Kitamura, et al . 1994) as a 
negative control had no effect on Fas/FAP-1 binding 
activity under the same biochemical conditions. Second, 
the effect of truncated C-terminal synthetic peptides of 
Fas on Fas/FAP-1 binding in vitro was examined. As shown 
in Figure 3B, only the three C-terminal amino acids 
(Ac-SLV) were sufficient to obtain the same level of 
inhibitory effect on the binding of FAP-1 to Fas as 
25 achieved with the 4-15 synthetic peptides. Furthermore, 

Fas/FAP-1 binding was extensively investigated using the 
scanned tripeptides to determine the critical amino acids 
residues required for inhibition (Figure 3C) . The 
results revealed that the third amino acids residues from 
30 the C-terminus, and the C-terminal amino acids having the 

strongest inhibitory effect were either serine or 
threonine; and either valine, leucine, or isoleucine, 
respectively. However, there were no differences among 
the second amino acid residues from the C-terminus with 
35 respect to their inhibitory effect on Fas/FAP-1 binding. 

These results were consistent with those of the yeast 
two-hybrid system (Figures 2C and 2D) . Therefore, it was 
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concluded that the C-terminal three amino acids (SLV) are 
critical determinants of Fas binding to the third PDZ 
domain of FAP-1 protein. 

5 To further substantiate that the PDZ domain interacts 

with tS/T-X-V/L/I under more native conditions, GST- fused 
FAP-1 proteins were tested for their ability to interact 
with Fas expressed in Jurkat T-cells. The results 
revealed that the tripeptide Ac-SLV, but not Ac-SLY, 

10 abolished in a dose -dependent manner the binding activity 

of FAP-1 to Fas proteins extracted from Jurkat T-cells 
(Figures 4C and 4D) . This suggests that the C-terminal 
amino acids tSLV are the minimum binding site for FAP-1, 
and that the amino acids serine and valine are critical 

15 for this physical association. 

To next examine the hypothesis that the physiological 
association between the C-terminal three amino acids of 
Fas and the third PDZ domain of FAP-1 is necessary for 

20 the in vivo function of FAP-1 as a negative regulator of 

Fas -mediated signal transduction, a microinjection 
experiment was employed with synthetic tripeptides in a 
colon cancer cell line, DLD-1, which expresses both Fas 
and FAP-1, and is resistant to Fas -induced apoptosis. 

25 The experiments involved the direct microinjection of the 

synthetic tripeptides into the cytoplasmic regions of 
single cells and the monitoring of the physiological 
response to Fas -induced apoptosis in vivo. The results 
showed that microinjection of Ac-SLV into DLD-1 cells 

3 0 dramatically induced apoptosis in the. presence of 

Fas -monoclonal antibodies (CH11, 500 ng/ml) (Figures 5A, 
5E and Figure 6) , but that microinjection of Ac -SLY and 
PBS/K did not (Figures 5B, 5F and Figure 6) . These 
results strongly support the hypothesis that the physical 

35 association of FAP-1 with the C-terminus of Fas is 

essential for protecting cells from Fas- induced 
apoptosis . 
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In summary, it was found that the C-terminal SLV of Fas 
is alone necessary and sufficient for binding to the 
third PD2 domain of FAP-1. Secondly, it is proposed that 
the new consensus motif of tS/T-X-V/L/I for such binding 
5 to the PDZ domain, instead of tS/T-X-V. It is therefore 

possible that FAP-1 plays important roles for the 
modulation of signal transduction pathways in addition to 
its physical interaction with Fas. Thirdly, it is 
demonstrated that the targeted induction of Fas-mediated 
10 apoptosis in colon cancer cells by direct microinjection 

of the ' tripeptide Ac-SLV. Further investigations 
including the identification of a substrate (s) of FAP-1 
and structure -function analysis will provide insight to 
the potential therapeutic applications of Fas/FAP-1 
15 interaction in cancer as well as provide a better 

understanding of the inhibitory effect of FAP-1 on 
Fas-mediated signal transduction. 



WO 98/05347 PCT/US97/12677 

-32- 

SECOND SERIES OF EXPERIMENTS 

FAP-1 was originally identified as a membrane -associated 
protein tyrosine phosphatase which binds to the C- 
5 terminus of Fas, and possesses six PDZ domains (also 

known as DHR domain or GLGF repeat) . PDZ domain has 
recently been shown as a novel module for specific 
protein-protein interaction, and it appears to be 
important in the assembly of membrane proteins and also 

10 in linking signaling molecules in a multiprotein complex. 

In recent comprehensive studies, it was found that the 
third PDZ domain of FAP-1 specifically recognized the 
sequence motif t (S/T) -X-V and interacts with the C- 
terminal three amino acids SLV of Fas (Fig. 9) . In order 

15 to investigate the possibility that FAP-1 also interacts 

with the C-terminal region of p75NGFR (Fig. 8), an in 
vitro binding assay, was performed as well as, a yeast 
two-hybrid analysis by using a series of deletion mutants 
of p75NGFR. The results revealed that the C-terminal 

2 0 cytoplasmic region of p75NGFR, which is highly conserved 

among all species, interacts with FAP-1 (Fig. 10). 
Furthermore, the C-terminal three amino acids SPV of 
p75NGFR were necessary and sufficient for the interaction 
with the third PDZ domain of FAP-1 (Fig. 11A and 11B) . 
25 Since FAP-1 expression was found highest in fetal brain, 

these findings imply that interaction of FAP-1 with 
p75NGFR plays an important role for signal transduction 
pathway via p75NGFR in neuronal cells as well as in the 
formation of the initial signal- transducing complex for 

3 0 p75NGFR. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Takaaki Sato and Junn Yanagisawa 

(ii) TITLE OF INVENTION: COMPOUNDS THAT INHIBIT THE 

INTERACTION BETWEEN SIGN AL- 
IO TRANSDUCING PROTEINS AND THE GLGF 

(PDZ/DHR) DOMAIN AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 3 3 

15 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cooper & Dunham LLP 

(B) STREET: 1185 Avenue of the Americas 

(C) CITY: New York 
<D) STATE: New York 

20 (E) COUNTRY: U.S.A. 

(F) ZIP : 10036 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
25 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 
3 0 (A) APPLICATION NUMBER: Not Yet Known 

(B) FILING DATE: 18-JUL-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 
3 5 (A) NAME: White, John P 

(B) REGISTRATION NUMBER: 28,678 

(C) REFERENCE /DOCKET NUMBER: 0575/4 8962 -A-PCT/JPW/JKM 

<ix) TELECOMMUNICATION INFORMATION: 
40 (A) TELEPHONE: (212) 278-0400 

(B) TELEFAX: (212) 391-0525 

(2) INFORMATION FOR SEQ ID NO:l: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

55 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Gly/Ser/Ala/Glu Leu Gly Phe/Ile/Leu 
60 1 



(2) INFORMATION FOR SEQ ID NO : 2 : 

65 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

10 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Lys/Arg/Gln Xaa (n) Gly/Ser/Ala/Glu Leu Gly Phe/Ile/Leu 
1 5 



15 



25 



35 



45 



55 



65 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS.: 
(A) LENGTH: 4 amino acids 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
30 ( Xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



Ser Leu Gly He 
1 

(2) INFORMATION FOR SEQ ID NO: 4 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
4 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

Ser/Thr Xaa Val/Ile/Leu 



1 



(2) INFORMATION FOR SEQ ID NO : 5 ; 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 
g0 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 
( d ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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Asp Ser Glu Asn Ser Asn Phe Arg Asn Glu lie Gin Ser Leu Val 
15 10 15 

5 (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE : peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Ser lie Ser Asn Ser Arg Asn Glu Asn Glu Gly Gin Ser Leu Glu 
15 10 15 



20 



30 



35 



60 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Ser Thr Pro Asp Thr Gly Asn Glu Asn Glu Gly Gin Cys Leu Glu 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Glu Ser Leu Val 
50 1 

(2) INFORMATION FOR SEQ ID NO: 9: 

55 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



6 5 Thr He Gin Ser Val He 

1 5 



10 



15 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 10: 

Arg Gly Phe lie Ser Ser Leu Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Arg Glu Thr lie Glu Ser Thr Val 
30 1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

45 Gin Asn Phe Arg Thr Tyr He Val Ser Phe Val 

1 5 10 



50 



60 



(2) INFORMATION FOR SEQ ID NO: 13 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
55 (d) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(xil SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
Ser Asp Ser Asn Met Asn Met Asn Glu Leu Ser Glu Val 



65 



(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

10 Pro Pro Thr Cys Ser Gin Ala Asn Ser Gly Arg lie Ser Thr Leu 

15 10 15 



15 



25 



60 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



lie Asp Leu Ala Ser Glu Phe Leu Phe Leu Ser Asn Ser Phe Leu 
15 .10 15 



3 0 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

3 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16 : 

Asp Ser Glu Met Tyr Asn Phe Arg Ser Gin Leu Ala Ser Val Val 
15 10 15 

4 5 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

5 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

lie Pro Pro Asp Ser Glu Asp Gly Asn Glu Glu Gin Ser Leu Val 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
6 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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10 



20 



55 



60 
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(ii) MOLECULE TYPE: peptide 

(xii SEQUENCE DESCRIPTION: SEQ ID NO:18: 



Gin Ser Leu Val 
1 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (d) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



He Gin Ser Leu Val 
1 5 



25 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

oq (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
35 (xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Glu He Gin Ser Leu Val 
1 5 

4 0 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

4 c (c) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

50 (xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Asn Glu He Gin Ser Leu Val 



1 5 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) .SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
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Arg Asn Glu He Gin Ser Leu Val 
1 5 

5 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Asp Ser Glu Asn Ser Asn Phe Arg Asn Glu He Gin Ser Leu Val 
15 10 15 



20 



30 



35 



50 



65 



(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 7 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 24 : 

Met Gly Ala Gly Ala Thr Gly Arg Ala Met Asp Gly Pro Arg Leu Leu 
1 5 10 15 

Leu Leu Leu Leu Leu Gly Val Ser Leu Gly Gly Ala Lys Glu Ala Cys 
20 "* 25 30 



Pro Thr Gly Leu Tyr Thr His Ser Gly Glu Cys Cys Lys Ala Cys Asn 
40 35 40 45 

Leu Gly Glu Gly Val Ala Gin Pro Cys Gly Ala Asn Gin Thr Val Cys 
50 55 60 

45 Glu Pro Cys Leu Asp Ser Val Thr Phe Ser Asp Val Val Ser Ala Thr 

65 70 75 80 



Glu Pro Cys Lys Pro Cys Thr Glu Cys Val Gly Leu Gin Ser Met Ser 
85 90 95 

Ala Pro Cys Val Glu Ala Asp Asp Ala Val Cys Arg Cys Ala Tyr Gly 

100 105 - 110 



Tyr Tyr Gin Asp Glu Thr Thr Gly Arg Cys Glu Ala Cys Arg Val Cys 
55 115 120 125 

Glu Ala Gly Ser Gly Leu Val Phe Ser Cys Gin Asp Lys Gin Asn Thr 
130 135 ** 140 

60 Val Cys Glu Glu Cys Pro Asp Gly Thr Tyr Ser Asp Glu Ala Asn His 

145 " 150 155 160 



Val Asp Pro Cys Leu Pro Cys Thr Val Cys Glu Asp Thr Glu Arg Gin 

165 170 175 

Leu Arg Glu Cys Thr Arg Trp Ala Asp Ala Glu Cys Glu Glu lie Pro 

180 185 190 
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Gly Arg Trp lie Thr Arg Ser Thr Pro Pro Glu Gly Ser Asp Ser Thr 



10 



15 



40 



45 



60 



195 



Ala Pro Ser Thr Gin Glu Pro Glu Ala Pro Pro Glu Gin Asp Leu He 

215 220 



210 



Ala Ser Thr Val Ala Gly Val Val Thr Thr Val Met Gly Ser Ser Gin 



225 



Pro Val Val Thr Arg Gly Thr Thr Asp Asn Leu He Pro Val Tyr Cys 
2 45 250 * ss 



Ser He Leu Ala Ala Val Val Val Gly Leu Val Ala Tyr lie Ala Phe 

260 265 
Lys Arg Trp Asn Ser Cys Lys Gin Asn Lys Gly Gly Ala Asn Ser Arg 

Pro Val Asn Gin Thr Pro Pro Pro Glu Gly Glu Lys He His Ser Asp 
20 290 2 »5 300 

Ser Gly He Ser Val Asp Ser Gin Ser Leu His Asp Gin Gin Pro His 

305 310 315 

25 Thr Gin Thr Ala Ser Gly Gin Ala Leu Lys Gly Asp Gly Gly Leu Tyr 

325 

Ser Ser Leu Pro Pro Ala Lys Arg Glu Glu Val Glu Lys Leu Leu Asn 
340 345 

30 Gly Ser Ala Gly Asp Thr Trp Arg His Leu Ala Gly Glu Leu Gly Tyr 

355 360 
Gin Pro Glu His lie Asp Ser Phe Thr His Glu Ala Cys Pro Val Arg 
35 370 375 

Ala Leu Leu Ala Ser Trp Ala Thr Gin Asp Ser Ala Thr Leu Asp Ala 

390 



385 



L eu Leu Ala Ala Leu Arg Arg He Gin Arg Ala Asp Leu Val Glu Ser 
405 410 



Leu Cys Ser Glu Ser Thr Ala Thr Ser Pro Val 



420 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 
5 q (A) LENGTH: 458 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 
1 5 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 
6 5 Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
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He Gin Phe His Trp Lys Asn Ser Asn Gin lie Lys He Leu Gly Asn 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Pro Leu He He 
85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 
100 105 110 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr He Thr Leu Glu 
130 135 140 

Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 
145 * 150 155 160 

Lys Asn He Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 
165 * * 170 175 

Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 
180 185 190 

Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 
195 200 205 

Ser He Val Tyr Lys Lys Glu Gly Glu Gin Val Glu Phe Ser Phe Pro 
210 215 220 

Leu Ala Phe Thr Val Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 
225 230 235 240 

Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp He Thr Phe Asp Leu 
245 250 255 

Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin Asp Pro Lys Leu 
260 265 270 

Gin Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gin Ala Leu 
275 ~ 280 285 

Pro Gin Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala Leu Glu Ala Lys 
290 295 300 

Thr Gly Lys Leu His Gin Glu Asn Val Leu Val Val Met Arg Ala Thr 
305 ' 310 315 320 

Gin Leu Gin Lys Asn Leu Thr Cys Glu Val Trp Gly Pro Thr Ser Pro 
325 330 335 

Lys Leu Met Leu Ser Leu Lys Leu Glu Asn Lys Glu Ala Lys Val Ser 
340 345 350 

Lys Arg Glu Lys Ala Val Trp Val Leu Asn Pro Glu Ala Gly Met Trp 
355 360 365 

Gin Cys Leu Leu Ser Asp Ser Gly Gin Val Leu Leu Glu Ser Asn He 
370 375 380 

Lys Val Leu Pro Thr Trp Ser Thr Pro Val Gin Pro Met Ala Leu He 
385 390 395 400 

Val Leu Gly Gly Val Ala Gly Leu Leu Leu Phe He Gly Leu Gly He 



WO 98/05347 



40 



45 



60 
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405 410 415 

Phe Phe Cys Val Arg Cys Arg His Arg Arg Arg Gin Ala Glu Arg Met 
420 425 

ser Gin He Lys Arg Leu Leu Ser Glu Lys Lys Glu Cys Gin Cys Pro 
a a n 445 



435 



His Arg Phe Gin Lys Thr Cys Ser Pro He 
!0 450 455 

(2) INFORMATION FOR SEQ ID NO: 26: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 .j 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
25 Met Asn Ser Gly Val Ala Met Lys Tyr Gly Asn Asp Ser Ser Ala Glu 

1 5 

Leu Ser Glu Leu His Ser Ala Ala Leu Ala Ser Leu Lys Gly Asp lie 
20 25 
30 val Glu Leu Asn Lys Arg Leu Gin Gin Thr Glu Arg Glu Asp Leu Leu 

35 40 45 

Glu Lys Lys Leu Ala Lys Ala Gin Cys Glu Gin Ser His Leu Met Arg 
35 50 55 60 

Glu His Glu Asp Val Gin Glu Arg Thr Thr Leu Arg Tyr Glu Glu Arg 



65 ™ 75 



He Thr Glu Leu His Ser Val He Ala Glu Leu Asn Lys Lys lie Asp 

85 90 
Arg L eu Gin Gly Thr Thr lie Arg Glu Glu Asp Glu Tyr Ser Glu Leu 



Arg Ser Glu Leu Ser Gin Ser Gin His Glu Val Asn Glu Asp Ser Arg 

Ser Met Asp Gin Asp Gin Thr Ser Val Ser He Pro Glu Asn Gin Ser 
50 130 135 

Thr Met Val Thr Ala Asp Met Asp Asn Cys Ser Asp He Asn Ser Glu 

145 15 0 155 

55 Leu Gin Arg Val Leu Thr Gly Leu Glu Asn Val Val Cys Gly Arg Lys 

Lys Ser Ser Cys Ser Leu Ser Val Ala Glu Val Asp Arg His He Glu 

180 185 
Gin Leu Thr Thr Ala Ser Glu His Cys Asp Leu Ala lie Lys Thr Val 

Glu Glu He Glu Gly Val Leu Gly Arg Asp Leu Tyr Pro Asn Leu Ala 
65 ' 210 215 220 

Glu Glu Arg Ser Arg Trp Glu Lys Glu Leu Ala Gly Leu Arg Glu Glu 
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225 230 235 240 

Asn Glu Ser Leu Thr Ala Met Leu Cys Ser Lys Glu Glu Glu Leu Asn 
245 250 255 

Arg Thr Lys Ala Thr Met Asn Ala lie Arg Glu Glu Arg Asp Arg Leu 
260 265 270 

Arg Arg Arg Val Arg Glu Leu Gin Thr Arg Leu Gin Ser Val Gin Ala 
275 280 285 

Thr Gly Pro Ser Ser Pro Gly Arg Leu Thr Ser Thr Asn Arg Pro lie 
290 295 300 

Asn Pro Ser Thr Gly Glu Leu Ser Thr Ser Ser Ser Ser Asn Asp lie 
305 310 315 320 

Pro lie Ala Lys lie Ala Glu Arg Val Lys Leu Ser Lys Thr Arg Ser 
325 330 335 

Glu Ser Ser Ser Ser Asp Arg Pro Val Leu Gly Ser Glu lie Ser Ser 
340 ~ 345 350 

lie Gly Val Ser Ser Ser Val Ala Glu His Leu Ala His Ser Leu Gin 
355 360 365 

Asp Cys Ser Asn lie Gin Glu lie Phe Gin Thr Leu Tyr Ser His Gly 
370 375 380 

Ser Ala lie Ser Glu Ser Lys lie Arg Glu Phe Glu Val Glu Thr Glu 
385 390 395 400 

Arg Leu Asn Ser Arg lie Glu His Leu Lys Ser Gin Asn Asp Leu Leu 
405 410 **" 415 

Thr lie Thr Leu Glu Glu Cys Lys Ser Asn Ala Glu Arg Met Ser Met 
420 425 430 

Leu Val Gly Lys Tyr Glu Ser Asn Ala Thr Ala Leu Arg Leu Ala Leu 
435 440 445 

Gin Tyr Ser Glu Gin Cys lie Glu Ala Tyr Glu Leu Leu Leu Ala Leu 
450 455 460 

Ala Glu Ser Glu Gin Ser Leu lie Leu Gly Gin Phe Arg Ala Ala Gly 
465 470 475 ~ 480 

Val Gly Ser Ser Pro Gly Asp Gin Ser Gly Asp Glu Asn lie Thr Gin 
485 490 495 

Met Leu Lys Arg Ala His Asp Cys Arg Lys Thr Ala Glu Asn Ala Ala 
500 505 510 

Lys Ala Leu Leu Met Lys Leu Asp Gly Ser Cys Gly Gly Ala Phe Ala 
515 520 ~ 525 

Val Ala Gly Cys Ser Val Gin Pro Trp Glu Ser Leu Ser Ser Asn Ser 
530 ~ " 535 " 540 

His Thr Ser Thr Thr Ser Ser Thr Ala Ser Ser Cys Asp Thr Glu Phe 
545 550 555 560 

Thr Lys Glu Asp Glu Gin Arg Leu Lys Asp Tyr lie Gin Gin Leu Lys 
565 570 575 

Asn Asp Arg Ala Ala Val Lys Leu Thr Met Leu Glu Leu Glu Ser lie 
580 585 590 
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His He Asp Pro Leu Ser Tyr Asp Val Lys Pro Arg Gly Asp Ser Gin 
595 600 605 

Arq Leu Asp Leu Glu Asn Ala Val Leu Met Gin Glu Leu Met Ala Met 
5 S 610 615 620 

Lys Glu Glu Met Ala Glu Leu Lys Ala Gin Leu Tyr Leu Leu Glu Lys 

10 Glu Lys L ys Ala Leu Glu Leu Lys Leu Ser Thr Arg Glu Ala Gin Glu 

645 650 655 

Gin Ala Tyr Leu Val His He Glu His Leu Lys Ser Glu Val Glu Glu 
660 665 670 

15 G m Lys Glu Gin Arg Met Arg Ser Leu Ser Ser Thr Ser Ser Gly Ser 

675 ~ 680 685 

Lvs Asp Lys Pro Gly Lys Glu Cys Ala Asp Ala Ala Ser Pro Ala Leu 
20 690 695 700 

ser Leu Ala Glu Leu Arg Thr Thr Cys Ser Glu Asn Glu Leu Ala Ala 
705 710 715 720 

05 Glu Phe Thr Asn Ala He Arg Arg Glu Lys Lys Leu Lys Ala Arg Val 

725 730 "35 

Gin Glu Leu Val Ser Ala Leu Glu Arg Leu Thr Lys Ser Ser Glu He 
740 745 750 

30 His Gin Gin Ser Ala Glu Phe Val Asn Asp Leu Lys Arg Ala Asn 

755 760 765 

Ser Asn Leu Val Ala Ala Tyr Glu Lys Ala Lys Lys Lys His Gin Asn 
35 770 77S 780 

Lys Leu Lys Lys Leu Glu Ser Gin Met Met Ala Met Val Glu Arg His 
785 * 790 795 

Glu Thr Gin Val Arg Met Leu Lys Gin Arg He Ala Leu Leu Glu Glu 
80S 810 815 

Glu Asn Ser Arg Pro His Thr Asn Glu Thr Ser Leu 
820 825 



40 



45 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
5 0 (A) LENGTH: 672 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Ala Asp Val Phe Pro Gly Asn Asp Ser Thr Ala Ser Gin Asp Val 
60 1 5 10 

Ala Asn Arg Phe Ala Arg Lys Gly Ala Leu Arg Gin Lys Asn Val His 
20 25 

65 Glu Val Lys Asp His Lys Phe He Ala Arg Phe Phe Lys Gin Pro Thr 



35 
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Phe Cys Ser His Cys Thr Asp Phe lie Trp Gly Phe Gly Lys Gly Gly 
50 55 60 

Phe Gin Cys Gin Val Cys Cys Phe Val Val His Lys Arg Cys His Glu 
65 70 75 80 

Phe Val Thr Phe Ser Cys Pro Gly Ala Asp Lys Gly Pro Asp Thr Asp 
85 90 95 

Asp Pro Arg Ser Lys His Lys Phe Lys lie His Thr Tyr Gly Ser Pro 
100 105 110 

Thr Phe Cys Asp His Cys Gly Ser Leu Leu Tyr Gly Leu lie His Gin 
115 120 125 

Gly Met Lys Cys Asp Thr Cys Asp Met Asn Val His Lys Gin Cys Val 
130 135 140 

He Asn Val Pro Ser Leu Cys Gly Met Asp His Thr Glu Lys Arg Gly 
145 150 155 160 

Arg He Tyr Leu Lys Ala Glu Val Ala Asp Glu Lys Leu His Val Thr 
165 170 175 

Val Arg Asp Ala Lys Asn Leu He Pro Met Asp Pro Asn Gly Leu Ser 
180 185 190 

Asp Pro Tyr Val Lys Leu Lys Leu He Pro Asp Pro Lys Asn Glu Ser 
195 200 205 

Lys Gin Lys Thr Lys Thr He Arg Ser Thr Leu Asn Pro Gin Trp Asn 
210 215 220 

Glu Ser Phe Thr Phe Lys Leu Lys Pro Ser Asp Lys Asp Arg Arg Leu 
225 230 " 235 240 

Ser Val Glu He Trp Asp Trp Asp Arg Thr Thr Arg Asn Asp Phe Met 
245 250 255 

Gly Ser Leu Ser Phe Gly Val Ser Glu Leu Met Lys Met Pro Ala Ser 
260 265 270 

Gly Trp Tyr Lys Leu Leu Asn Gin Glu Glu Gly Glu Tyr Tyr Asn Val 
275 280 285 

Pro He Pro Glu Gly Asp Glu Glu Gly Asn Met Glu Leu Arg Gin Lys 
290 ' 295 300 

Phe Glu Lys Ala Lys Leu Gly Pro Ala Gly Asn Lys Val He Ser Pro 
305 310 315 320 

Ser Glu Asp Arg Lys Gin Pro Ser Asn Asn Leu Asp Arg Val Lys Leu 
325 330 335 

Thr Asp Phe Asn Phe Leu Met Val Leu Gly Lys Gly Ser Phe Gly Lys 
340 345 350 

Val Met Leu Ala Asp Arg Lys Gly Thr Glu Glu Leu Tyr Ala He Lys 
355 ~ 360 365 

He Leu Lys Lys Asp Val Val He Gin Asp Asp Asp Val Glu Cys Thr 
370 375 380 

Met Val Glu Lys Arg Val Leu Ala Leu Leu Asp Lys Pro Pro Phe Leu 
385 390 395 400 

Thr Gin Leu His Ser Cys Phe Gin Thr Val Asp Arg Leu Tyr Phe Val 
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405 410 415 

Met Glu Tyr Val Asn Gly Gly Asp Leu Met Tyr His He Gin Gin Val 

b Glv Lvs Phe Lys Glu Pro Gin Ala Val Phe Tyr Ala Ala Glu He Ser 

435 440 445 

He Glv Leu Phe Phe Leu His Lys Arg Gly He He Tyr Arg Asp Leu 
450 455 460 

Lys Leu Asp Asn Val Met Leu Asp Ser Glu Gly His He Lys He Ala 
4S 5 * 470 475 480 

, 5 Asp Phe Gi y M et Cys Lys Glu His Met Met Asp Gly Val Thr Thr Arg 

485 490 495 

Thr Phe Cys Gly Thr Pro Asp Tyr He Ala Pro Glu lie He Ala Tyr 
500 505 510 



10 



20 



30 



35 



50 



60 



Gin Pro Tyr Gly Lys Ser Val Asp Trp Trp Ala Tyr Gly Val Leu Leu 
515 520 525 

Tvr Glu Met Leu Ala Gly Gin Pro Pro Phe Asp Gly Glu Asp Glu Asp 
25 530 535 540 

Glu Leu Phe Gin Ser He Met Glu His Asn Val Ser Tyr Pro Lys Ser 

cclci 555 t,bu 



545 



Leu Ser Lys Glu Ala Val Ser He Cys Lys Gly Leu Met Thr Lys His 
565 570 575 

Pro Ala Lys Arg Leu Gly Cys Gly Pro Glu Gly Glu Arg Asp Val Arg 
580 585 590 

Glu His Ala Phe Phe Arg Arg He Asp Trp Glu Lys Leu Glu Asn Arg 
595 600 605 

Glu He Gin Pro Pro Phe Lys Pro Lys Val Cys Gly Lys Gly Ala Glu 
40 610 615 620 

Asn Phe Asp Lys Phe Phe Thr Arg Gly Gin Pro Val Leu Thr Pro Pro 

45 Asp Gin Leu Val He Ala Asn He Asp Gin Ser Asp Phe Glu Gly Phe 

645 650 655 

Ser Tyr Val Asn Pro Gin Phe Val His Pro He Leu Gin Ser Ala Val 
- ~ - 66 5 67 0 



660 



(2) INFORMATION FOR SEQ ID NO: 28: 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

65 Met Asp He Leu Cys Glu Glu Asn Thr Ser Leu Ser Ser Thr Thr Asn 

! 5 10 xb 
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Ser Leu Met Gin Leu Asn Asp Asp Thr Arg Leu Tyr Ser Asn Asp Phe 
20 25 30 

Asn Ser Gly Glu Ala Asn Thr Ser Asp Ala Phe Asn Trp Thr Val Asp 
5 35 40 45 

Ser Glu Asn Arg Thr Asn Leu Ser Cys Glu Gly Cys Leu Ser Pro Ser 
50 ' 55 60 

10 Cys Leu Ser Leu Leu His Leu Gin Glu Lys Asn Trp Ser Ala Leu Leu 

65 70 75 80 



15 



30 



45 



60 



Thr Ala Val Val lie lie Leu Thr lie Ala Gly Asn lie Leu Val lie 
85 90 95 

Met Ala Val Ser Leu Glu Lys Lys Leu Gin Asn Ala Thr Asn Tyr Phe 
100 105 110 



Leu Met Ser Leu Ala lie Ala Asp Met Leu Leu Gly Phe Leu Val Met 
20 115 120 125 

Pro Val Ser Met Leu Thr lie Leu Tyr Gly Tyr Arg Trp Pro Leu Pro 
130 135 140 

2 5 Ser Lys Leu Cys Ala Val Trp lie Tyr Leu Asp Val Leu Phe Ser Thr 

145 150 155 160 



Ala Ser lie Met His Leu Cys Ala He Ser Leu Asp Arg Tyr Val Ala 
165 " 170 175 

He Gin Asn Pro He His His Ser Arg Phe Asn Ser Arg Thr Lys Ala 

180 185 190 



Phe Leu Lys He He Ala Val Trp Thr He Ser Val Gly He Ser Met 
35 195 200 205 

Pro He Pro Val Phe Gly Leu Gin Asp Asp Ser Lys Val Phe Lys Glu 
210 215 220 

4 0 Gly Ser Cys Leu Leu Ala Asp Asp Asn Phe Val Leu He Gly Ser Phe 

225 230 235 240 



Val Ser Phe Phe He Pro Leu Thr He Met Val He Thr Tyr Phe Leu 
245 250 255 

Thr He Lys Ser Leu Gin Lys Glu Ala Thr Leu Cys Val Ser Asp Leu 
260 265 270 



Gly Thr Arg Ala Lys Leu Ala Ser Phe Ser Phe Leu Pro Gin Ser Ser 
50 " 275 ~ 280 285 

Leu Ser Ser Glu Lys Leu Phe Gin Arg Ser He His Arg Glu Pro Gly 
290 295 300 

55 Ser Tyr Thr Gly Arg Arg Thr Met Gin Ser He Ser Asn Glu Gin Lys 

305 "* 310 315 320 



Ala Cys Lys Val Leu Gly He Val Phe Phe Leu Phe Val Val Met Trp 
325 330 335 

Cys Pro Phe Phe He Thr Asn He Met Ala Val He Cys Lys Glu Ser 
340 345 350 



Cys Asn Glu Asp Val He Gly Ala Leu Leu Asn Val Phe Val Trp He 
65 355 ~ 360 365 

Gly Tyr Leu Ser Ser Ala Val Asn Pro Leu Val Tyr Thr Leu Phe Asn 
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370 375 380 

Lys Thr Tyr Arg Ser Ala Phe Ser Arg Tyr lie Gin Cys Gin Tyr Lys 
385 390 395 

5 Glu Asn Lys Lys Pro Leu Gin Leu He Leu Val Asn Thr He Pro Ala 

405 410 
Leu Ala Tyr Lys Ser Ser Gin Leu Gin Met Gly Gin Lys Lys Asn Ser 

io 420 425 

L.ys Gin Asp Ala Lys Thr Thr Asp Asn Asp Cys Ser Met Val Ala Leu 

435 440 
Gly Lys Gin His Ser Glu Glu Ala Ser Lys Asp Asn Ser Asp Gly Val 



15 



20 



50 



55 



60 



65 



450 

Asn Glu Lys Val Ser Cys Val 
465 470 



35 X 5 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
2 c (A) LENGTH: 481 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

3Q (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

Met Ala Leu Ser Tyr Arg Val Ser Glu Leu Gin Ser Thr He Pro Glu 

His lie Leu Gin Ser Thr Phe Val His Val He Ser Ser Asn Trp Ser 
20 25 
40 Gly Leu Gin Thr Glu Ser He Pro Glu Glu Met Lys Gin He Val Glu 

Glu Gin Gly Asn Lys Leu His Trp Ala Ala Leu Leu He Leu Met Val 
50 55 
45 He He Pro Thr He Gly Gly Asn Thr Leu Val He Leu Ala Val Ser 

65 70 

L eu Glu Lys Lys Leu Gin Tyr Ala Thr Asn Tyr Phe Leu Met Ser Leu 

85 yu 
Ala Val Ala Asp Leu Leu Val Gly Leu Phe Val Met Pro He Ala Leu 

L eu Thr lie Met Phe Glu Ala Met Trp Pro Leu Pro Leu Val Leu Cys 

Pro Ala Trp Leu Phe Leu Asp Val Leu Phe Ser Thr Ala Ser He Met 

130 135 
His Leu cys Ala He Ser Val Asp Arg Tyr He Ala He Lys Lys Pro 
145 150 155 

He Gin Ala Asn Gin Tyr Asn Ser Arg Ala Thr Ala Phe He Lys He 

165 170 
Thr Val Val Trp Leu He Ser He Gly He Ala He Pro Val Pro He 
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180 185 190 

Lys Gly He Glu Thr Asp Val Asp Asn Pro Asn Asn He Thr Cys Val 
195 200 205 

Leu Thr Lys Glu Arg Phe Gly Asp Phe Met Leu Phe Gly Ser Leu Ala 
210 215 220 

Ala Phe Phe Thr Pro Leu Ala He Met He Val Thr Tyr Phe Leu Thr 
10 225 230 235 240 

He His Ala Leu Gin Lys Lys Ala Tyr Leu Val Lys Asn Lys Pro Pro 
245 250 255 

15 Gin Arg Leu Thr Trp Leu Thr Val Ser Thr Val Phe Gin Arg Asp Glu 

260 265 270 



20 



35 



50 



Thr Pro Cys Ser Ser Pro Glu Lys Val Ala Met Leu Asp Gly Ser Arg 
275 280 285 

Lys Asp Lys Ala Leu Pro Asn Ser Gly Asp Glu Thr Leu Met Arg Arg 
290 295 300 



Thr Ser Thr He Gly Lys Lys Ser Val Gin Thr He Ser Asn Glu Gin 
25 305 310 315 320 

Arg Ala Ser Lys Val Leu Gly He Val Phe Phe Leu Phe Leu Leu Met 
325 330 335 

3 0 Trp Cys Pro Phe Phe He Thr Asn He Thr Leu Val Leu Cys Asp Ser 

340 345 350 



Cys Asn Gin Thr Thr Leu Gin Met Leu Leu Glu He Phe Val Trp He 
355 360 365 

Gly Tyr Val Ser Ser Gly Val Asn Pro Leu Val Tyr Thr Leu Phe Asn 
370 375 380 



Lys Thr Phe Arg Asp Ala Phe Gly Arg Tyr He Thr Cys Asn Tyr Arg 
40 385 390 395 400 

Ala Thr Lys Ser Val Lys Thr Leu Arg Lys Arg Ser Ser Lys He Tyr 
405 410 415 

45 phe Arg Asn Pro Met Ala Glu Asn Ser Lys Phe Phe Lys Lys His Gly 

420 425 430 



He Arg Asn Gly He Asn Pro Ala Met Tyr Gin Ser Pro Met Arg Leu 
435 440 445 

Arg Ser Ser Thr He Gin Ser Ser Ser He He Leu Leu Asp Thr Leu 
450 455 460 



Leu Leu Thr Glu Asn Glu Gly Asp Lys Thr Glu Glu Gin Val Ser Val 
55 465 470 475 480 

Val 

60 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 843 amino acids 

(B) TYPE: amino acid 

65 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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25 



35 



40 



50 



55 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Ala Ala Ala Ser Tyr Asp Gin Leu Leu Lys Gin Val Glu Ala Leu 
! 5 10 Ai) 

Lvs Met Glu Asn Ser Asn Leu Arg Gin Glu Leu Glu Asp Asn Ser Asn 
20 25 30 

His Leu Thr Lys Leu Glu Thr Glu Ala Ser Asn Met Lys Glu Val Leu 
35 " 40 45 

Lvs Gin Leu Gin Gly Ser He Glu Asp Glu Ala Met Ala Ser Ser Gly 
50 55 60 

Gin He Asp Leu Leu Glu Arg Leu Lys Glu Leu Asn Leu Asp Ser Ser 

65 70 75 

Asn Phe Pro Gly Val Lys Leu Arg Ser Lys Met Ser Leu Arg Ser Tyr 
85 90 " 

Gly Ser Arg Glu Gly Ser Val Ser Ser Arg Ser Gly Glu Cys Ser Pro 
100 105 

Val Pro Met Gly Ser Phe Pro Arg Arg Gly Phe Val Asn Gly Ser Arg 



20 



115 



Glu Ser Thr Gly Tyr Leu Glu Glu Leu Glu Lys Glu Arg Ser Leu Leu 
30 130 135 140 

Leu Ala Asp Leu Asp Lys Glu Glu Lys Glu Lys Asp Trp Tyr Tyr Ala 
145 150 155 

Gin Leu Gin Asn Leu Thr Lys Arg He Asp Ser Leu Pro Leu Thr Glu 

Asn Phe Ser Leu Gin Thr Asp Met Thr Arg Arg Gin Leu Glu Tyr Glu 
180 185 190 



Ala Arg Gin He Arg Val Ala Met Glu Glu Gin Leu Gly Thr Cys Gin 
3 ig5 200 205 

Asp Met Glu Lys Arg Ala Gin Arg Arg He Ala Arg He Gin Gin He 
45 210 215 

Glu Lys Asp He Leu Arg He Arg Gin Leu Leu Gin Ser Gin Ala Thr 
225 " 230 235 

Glu Ala Glu Arg Ser Ser Gin Asn Lys His Glu Thr Gly Ser His Asp 
245 250 

Ala Glu Arg Gin Asn Glu Gly Gin Gly Val Gly Glu He Asn Met Ala 
260 265 



Thr Ser Gly Asn Gly Gin Gly Ser Thr Thr Arg Met Asp His Glu Thr 
275 280 28S 

Ala Ser Val Leu Ser Ser Ser Ser Thr His Ser Ala Pro Arg Arg Leu 
290 295 300 

Thr Ser His Leu Gly Thr Lys Val Glu Met Val Tyr Ser Leu Leu Ser 
305 31° 315 

Met Leu Gly Thr His Asp Lys Asp Asp Met Ser Arg Thr Leu Leu Ala 
325 330 



65 
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Met Ser Ser Ser Gin Asp Ser Cys lie Ser Met Arg Gin Ser Gly Cys 
340 345 350 

Leu Pro Leu Leu He Gin Leu Leu His Gly Asn Asp Lys Asp Ser Val 
355 360 365 

Leu Leu Gly Asn Ser Arg Gly Ser Lys Glu Ala Arg Ala Arg Ala Ser 
370 375 380 

Ala Ala Leu His Asn He He His Ser Gin Pro Asp Asp Lys Arg Gly 
385 390 395 400 

Arg Arg Glu He Arg Val Leu His Leu Leu Glu Gin He Arg Ala Tyr 
405 410 415 

Cys Ser Thr Cys Trp Glu Trp Gin Glu Ala His Glu Pro Gly Met Asp 
420 ^ 425 430 

Gin Asp Lys Asn Pro Met Pro Ala Pro Val Glu His Gin He Cys Pro 
435 440 445 

Ala Val Cys Val Leu Met Lys Leu Ser Phe Asp Glu Glu His Arg His 
450 455 460 

Ala Met Asn Glu Leu Gly Gly Leu Gin Ala He Ala Glu Leu Leu Gin 
465 470 475 480 

Val Asp Cys Glu Met Tyr Gly Leu Thr Asn Asp His Tyr Ser He Thr 
485 490 495 

Leu Arg Arg Tyr Ala Gly Met Ala Leu Thr Asn Leu Thr Phe Gly Asp 
500 505 510 

Val Ala Asn Lys Ala Thr Leu Cys Ser Met Lys Gly Cys Met Arg Ala 
515 520 525 

Leu Val Ala Gin Leu Lys Ser Glu Ser Glu Asp Leu Gin Gin Val He 
530 535 540 

Ala Ser Val Leu Arg Asn Leu Ser Trp Arg Ala Asp Val Asn Ser Lys 
545 550 555 560 

Lvs Thr Leu Arg Glu Val Gly Ser Val Lys Ala Leu Met Glu Cys Ala 
565 570 575 

Leu Glu Val Lys Lys Glu Ser Thr Leu Lys Ser Val Leu Ser Ala Leu 
580 585 590 

Trp Asn Leu Ser Ala His Cys Thr Glu Asn Lys Ala Asp He Cys Ala 
595 600 60S 

Val Asp Gly Ala Leu Ala Phe Leu Val Gly Thr Leu Thr Tyr Arg Ser 
610 615 620 

Gin Thr Asn Thr Leu Ala He He Glu Ser Gly Gly Gly He Leu Arg 
625 630 635 640 

Asn Val Ser Ser Leu He Ala Thr Asn Glu Asp His Arg Gin He Leu 
645 650 655 

Ara Glu Asn Asn Cys Leu Gin Thr Leu Leu Gin His Leu Lys Ser His 
660 665 670 

Ser Leu Thr He Val Ser Asn Ala Cys Gly Thr Leu Trp Asn Leu Ser 
675 680 685 

Ala Arg Asn Pro Lys Asp Gin Glu Ala Leu Trp Asp Met Gly Ala Val 
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690 "5 700 

Ser Met Leu Lys Asn Leu He His Ser Lys His Lys Met He Ala Met 
705 710 715 

Gly Ser Ala Ala Ala Leu Arg Asn Leu Met Ala Asn Arg Pro Ala Lys 

Tyr Lys Asp Ala Asn He Met Ser Pro Gly Ser Ser Leu Pro Ser Leu 

740 745 
His val Arg Lys Gin Lys Ala Leu Glu Ala Glu Leu Asp Ala Gin His 
755 760 
15 Leu Ser Glu Thr Phe Asp Asn lie Asp Asn He Ser Pro Lys Ala Ser 

77 0 775 
His Arg Ser Lys Gin Arg His Lys Gin Ser Leu Tyr Gly Asp Tyr Val 

20 Phe Asp Thr Asn Arg His Asp Asp Asn Arg Ser Asp Asn Phe Asn Thr 

805 0AU 

Gly Asn Met Thr Val Leu Ser Pro Tyr Leu Asn Thr Thr Val Leu Pro 
820 

Ser Ser Ser Ser Ser Arg Gly Ser Leu Asp Ser Ser Arg Ser Glu Lys 
835 

Asp Arg Ser Leu Glu Arg Glu Arg Gly He Gly Leu Gly Asn Tyr His 

850 855 
Pro Ala Thr Glu Asn Pro Gly Thr Ser Ser Lys Arg Gly Leu Gin He 
865 870 
35 ser Thr Thr Ala Ala Gin He Ala Lys Val Met Glu Glu Val Ser Ala 

885 890 

Xle His Thr Ser Gin Glu Asp Arg Ser Ser Gly Ser Thr Thr Glu Leu 

900 905 
His Cys Val Thr Asp Glu Arg Asn Ala Leu Arg Arg Ser Ser Ala Ala 

915 920 
His Thr His Ser Asn Thr Tyr Asn Phe Thr Lys Ser Glu Asn Ser Asn 

930 935 
Arg Thr Cys Ser Met Pro Tyr Ala Lys Leu Glu Tyr Lys Arg Ser Ser 
945 950 " 

50 Asn Asp Ser Leu Asn Ser Val Ser Ser Ser Asp Gly Tyr Gly Lys Arg 

965 y,KJ 

Gly Gin Met Lys Pro Ser He Glu Ser Tyr Ser Glu Asp Asp Glu Ser 

980 985 
Lys Phe eg ser Tyr Gly Gin Tyr^ro Ala Asp Leu AlaHi. Lys He 

His Ser Ala Asn His Met Asp Asp Asn Asp Gly Gl^Leu Asp Thr Pro 
1010 1015 

lie Asn Tyr Ser Leu Lys Tyr Ser Asp Glu Gl^Leu Asn Ser Gly Arg^ 
1025 1030 

Gin Ser Pro Ser Gin Asn Glu Arg Trp Ala Arg Pro Lys His lie lie 
1045 1050 
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Glu Asp Glu lie Lys Gin Ser Glu Gin Arg Gin Ser Arg Asn Gin Ser 
1060 1065 1070 

Thr Thr Tyr Pro Val Tyr Thr Glu Ser Thr Asp Asp Lys His Leu Lys 
5 1075 1080 1085 

Phe Gin Pro His Phe Gly Gin Gin Glu Cys Val Ser Pro Tyr Arg Ser 
1090 1095 1100 

10 Arg Gly Ala Asn Gly Ser Glu Thr Asn Arg Val Gly Ser Asn His Gly 

1105 1110 1115 1120 



15 



30 



45 



60 



lie Asn Gin Asn Val Ser Gin Ser Leu Cys Gin Glu Asp Asp Tyr Glu 
1125 1130 1135 

Asp Asp Lys Pro Thr Asn Tyr Ser Glu Arg Tyr Ser Glu Glu Glu Gin 
1140 1145 1150 



His Glu Glu Glu Glu Arg Pro Thr Asn Tyr Ser lie Lys Tyr Asn Glu 
20 1155 1160 1165 

Glu Lys Arg His Val Asp Gin Pro lie Asp Tyr Ser lie Leu Lys Ala 
1170 1175 1180 

2 5 Thr Asp lie Pro Ser Ser Gin Lys Gin Ser Phe Ser Phe Ser Lys Ser 

1185 1190 * 1195 1200 



Ser Ser Gly Gin Ser Ser Lys Thr Glu His Met Ser Ser Ser Ser Glu 
1205 1210 1215 

Asn Thr Ser Thr Pro Ser Ser Asn Ala Lys Arg Gin Asn Gin Leu His 
1220 1225 " 1230 



Pro Ser Ser Ala Gin Ser Arg Ser Gly Gin Pro Gin Lys Ala Ala Thr 
35 1235 1240 1245 

Cys Lys Val Ser Ser lie Asn Gin Glu Thr lie Gin Thr Tyr Cys Val 
1250 1255 1260 

4 0 Glu Asp Thr Pro lie Cys Phe Ser Arg Cys Ser Ser Leu Ser Ser Leu 

1265 1270 " 1275 1280 



Ser Ser Ala Glu Asp Glu lie Gly Cys Asn Gin Thr Thr Gin Glu Ala 
1285 1290 1295 

Asp Ser Ala Asn Thr Leu Gin lie Ala Glu lie Lys Glu Lys lie Gly 
1300 1305 1310 



Thr Arg Ser Ala Glu Asp Pro Val Ser Glu Val Pro Ala Val Ser Gin 
50 1315 1320 1325 

His Pro Arg Thr Lys Ser Ser Arg Leu Gin Gly Ser Ser Leu Ser Ser 
1330 1335 1340 

55 Glu Ser Ala Arg His Lys Ala Val Glu Phe Ser Ser Gly Ala Lys Ser 

1345 1350 1355 ~ 1360 



Pro Ser Lys Ser Gly Ala Gin Thr Pro Lys Ser Pro Pro Glu His Tyr 
1365 1370 1375 

Val Gin Glu Thr Pro Leu Met Phe Ser Arg Cys Thr Ser Val Ser Ser 

1380 1385 ** " 1390 



Leu Asp Ser Phe Glu Ser Arg Ser lie Ala Ser Ser Val Gin Ser Glu 
65 1395 1400 1405 

Pro Cys Ser Gly Met Val Ser Gly lie lie Ser Pro Ser Asp Leu Pro 
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1410 1415 1420 

Asr> Ser Pro Gly Gin Thr Met Pro Pro Ser Arg Ser Lys Thr Pro Pro 
14 25 1430 1435 1440 

5 Pro pro Pro Gin Thr Ala Gin Thr Lys Arg Glu Val Pro Lys Asn Lys 

1445 1450 1455 

Ala Pro Thr Ala Glu Lys Arg Glu Ser Gly Pro Lys Gin Ala Ala Val 
10 * 1460 1465 1470 

Asn Ala Ala Val Gin Arg Val Gin Val Leu Pro Asp Ala Asp Thr Leu 
1475 1480 1485 

15 Leu His Phe Ala Thr Glu Ser Thr Pro Asp Gly Phe Ser Cys Ser Ser 

1490 1495 1500 

Ser Leu Ser Ala Leu Ser Leu Asp Glu Pro Phe He Gin Lys Asp Val 
1505 1510 1515 1520 

20 Glu Leu Arg He Met Pro Pro Val Gin Glu Asn Asp Asn Gly Asn Glu 

1525 1530 1535 

Thr Glu Ser Glu Gin Pro Lys Glu Ser Asn Glu Asn Gin Glu Lys Glu 
1540 1545 1550 

Ala Glu Lys Thr He Asp Ser Glu Lys Asp Leu Leu Asp Asp Ser Asp 
1555 1560 1565 

Asn Asp Asp He Glu He Leu Glu Glu Cys He He Ser Ala Met Pro 
P 1570 1575 1580 

Thr Lys Ser Ser Arg Lys Ala Lys Lys Pro Ala Gin Thr Ala Ser Lys 
1585 ' 1590 1595 1600 

Leu Pro Pro Pro Val Ala Arg Lys Pro Ser Gin Leu Pro Val Tyr Lys 
1605 1610 1615 

Leu Leu Pro Ser Gin Asn Arg Leu Gin Pro Gin Lys His Val Ser Phe 
1620 1625 1630 

Thr Pro Gly Asp Asp Met Pro Arg Val Tyr Cys Val Glu Gly Thr Pro 
16 35 1640 1645 

45 ne Asn Phe Ser Thr Ala Thr Ser Leu Ser Asp Leu Thr He Glu Ser 

1650 1655 1660 

Pro Pro Asn Glu Leu Ala Ala Gly Glu Gly Val Arg Gly Gly Ala Gin 
1665 1670 1675 1680 



25 



30 



35 



40 



50 



55 



60 



65 



Ser Gly Glu Phe Glu Lys Arg Asp Thr He Pro Thr Glu Gly Arg Ser 
1685 1690 1695 

Thr Asp Glu Ala Gin Gly Gly Lys Thr Ser Ser Val Thr lie Pro Glu 
1700 1705 1710 

Leu Asp Asp Asn Lys Ala Glu Glu Gly Asp He Leu Ala Glu Cys He 
1715 " 1720 1725 

Asn Ser Ala Met Pro Lys Gly Lys Ser His Lys Pro Phe Arg Val Lys 
1730 1735 1740 

Lys He Met Asp Gin Val Gin Gin Ala Ser Ala Ser Ser Ser Ala Pro 
174 5 1750 1755 1760 

Asn Lys Asn Gin Leu Asp Gly Lys Lys Lys Lys Pro Thr Ser Pro Val 
1765 1770 1775 
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Lys Pro lie Pro Gin Asn Thr Glu Tyr Arg Thr Arg Val Arg Lys Asn 
1780 1785 1790 

Ala Asp Ser Lys Asn Asn Leu Asn Ala Glu Arg Val Phe Ser Asp Asn 
5 1795 1800 1805 

Lys Asp Ser Lys Lys Gin Asn Leu Lys Asn Asn Ser Lys Asp Phe Asn 
1810 1815 1820 

10 Asp Lys Leu Pro Asn Asn Glu Asp Arg Val Arg Gly Ser Phe Ala Phe 

1825 1830 1835 1840 

Asp Ser Pro His His Tyr Thr Pro lie Glu Gly Thr Pro Tyr Cys Phe 
1845 1850 1855 

Ser Arg Asn Asp Ser Leu Ser Ser Leu Asp Phe Asp Asp Asp Asp Val 
I860 1865 1870 

Asp Leu Ser Arg Glu Lys Ala Glu Leu Arg Lys Ala Lys Glu Asn Lys 
20 X 1875 1880 1885 

Glu Ser Glu Ala Lys Val Thr Ser His Thr Glu Leu Thr Ser Asn Gin 
1890 1895 1900 

2 5 Gin Ser Ala Asn Lys Thr Gin Ala lie Ala Lys Gin Pro lie Asn Arg 

1905 1910 . 1915 1920 

Gly Gin Pro Lys Pro lie Leu Gin Lys Gin Ser Thr Phe Pro Gin Ser 
1925 1930 1935 

30 

Ser Lys Asp lie Pro Asp Arg Gly Ala Ala Thr Asp Glu Lys Leu Gin 
1940 1945 1950 

Asn Phe Ala lie Glu Asn Thr Pro Val Cys Phe Ser His Asn Ser Ser 
35 1955 I960 1965 

Leu Ser Ser Leu Ser Asp lie Asp Gin Glu Asn Asn Asn Lys Glu Asn 
1970 1975 1980 

4 0 Glu Pro lie Lys Glu Thr Glu Pro Pro Asp Ser Gin Gly Glu Pro Ser 

1985 1990 1995 2000 

Lys Pro Gin Ala Ser Gly Tyr Ala Pro Lys Ser Phe His Val Glu Asp 
2005 2010 2015 

45 

Thr Pro Val Cys Phe Ser Arg Asn Ser Ser Leu Ser Ser Leu Ser lie 
2020 2025 2030 

Asp Ser Glu Asp Asp Leu Leu Gin Glu Cys lie Ser Ser Ala Met Pro 
50 2035 ~ 2040 2045 

Lys Lys Lys Lys Pro Ser Arg Leu Lys Gly Asp Asn Glu Lys His Ser 
2050 2055 2060 

55 p ro Arg Asn Met Gly Gly lie Leu Gly Glu Asp Leu Thr Leu Asp Leu 

2065 2070 2075 2080 

Lys Asp lie Gin Arg Pro Asp Ser Glu His Gly Leu Ser Pro Asp Ser 
2085 2090 2095 

60 

Glu Asn Phe Asp Trp Lys Ala lie Gin Glu Gly Ala Asn Ser lie Val 
2100 2105 2110 

Ser Ser Leu His Gin Ala Ala Ala Ala Ala Cys Leu Ser Arg Gin Ala 
65 2115 2120 2125 

Ser Ser Asp Ser Asp Ser lie Leu Ser Leu Lys Ser Gly lie Ser Leu 
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2130 2135 2140 

Gly Ser Pro Phe His Leu Thr Pro Asp Gin Glu Glu Lys Pro Phe Thr 
2145 2150 2155 2160 

5 ser Asn Lys Gly Pro_Arg He Leu Lys ^Gly Glu Lys Ser Thr^eu 
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2165 



Glu Thr Lys Lys He Glu Ser Glu Se^Lys Gly He Lys Gly^ly Lys 



2180 



Lys val Tyr Lys Ser Leu He Thr Gly Lys Val Arg Ser Asn Ser Glu 

2200 2205 



2195 



15 i le ser Gly Gin Met Lys Gin Pro Leu Gin Ala Asn Met Pro Ser He 

x 3 ~ o 1 1 R 2 22 0 

Gly Val 2 _ 

2225 - 2230 2235 2240 



2210 2215 
Ser Arg Gly Arg Thr Met He His He Pro Gly Val Arg Asn Ser Ser 



Ser Ser Thr Ser ^° 5 Val Ser L ^ s *V °H Q ^° Pr ° ^ ^ 2f 5**° 
Ala Ser Lys Ser Pro Ser Glu Gly O^Vbx ^ a Thr Thr ^ 0 Pro 



2260 



Gly Ala Lys Pro Ser Val Lys Ser Glu Leu Ser Pro Val Ala Arg Gin 
2275 2280 

30 Thr Ser Gin lie Gly Gly Ser Ser Lys Ala Pro Ser Arg Ser Gly Ser 

2290 2295 2JUU 

Arg Asp Ser Thr Pro Se^Arg Pro Ala Gin Gl^Pro Leu Ser Arg PrO Q 

35 He Gin Ser Pro Gly Arg Asn Ser lie Ser Pro Gly Arg Asn Gly He 

2325 2330 

Ser Pro Pro Asn Lys He Ser Gin Leu Pro Arg Thr Ser Ser Pro Sar 
40 2340 2345 

Thr Ala Ser Thr Lys Ser Ser Gly Ser Gly Lys Met Ser Tyr Thr Ser 
2355 2360 2365 

45 Pro Gl^Arg Gin Met Ser Glr^Gln Asn Leu Thr ^Gln Thr Gly Leu 

Ser Lys Asn Ala Ser Ser He Pro Arg Ser Glu Ser Ala Ser Lys Oly 
2385 2390 2395 

Leu Asn Gin Met Asn Asn Gly Asn Gly Ala Asn Lys Lys Val Glu Leu 
2405 2410 ^ 

Ser Arg Met Ser Ser Thr Lys Ser Ser Gly Ser Glu Ser Asp Arg Ser 
2420 2425 

Glu Arg Pro Val Leu Val Arg Gin Ser Thr Phe lie Lys Glu Ala Pro 
2435-- 2440 244:> 

60 Ser Pr^Thr Leu Arg Arg Lys^Leu Glu Glu Ser Ala^er Phe Glu Ser 

Leu Ser Pro Ser Ser Arg Pro Ala Ser Pro Thr Arg Ser Gin Ala Oln 
2465 2470 2475 

65 Thr Pro Val Leu Ser Pro Ser Leu Pro Asp Met Ser Leu Ser Thr His 

2485 2490 
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Ser Ser Val Gin Ala Gly Gly Trp Arg Lys Leu Pro Pro Asn Leu Ser 
2500 2505 2510 

Pro Thr lie Glu Tyr Asn Asp Gly Arg Pro Ala Lys Arg His Asp lie 
2515 2520 2525 

Ala Arg Ser His Ser Glu Ser Pro Ser Arg Leu Pro lie Asn Arg Ser 
2530 2535 2540 

Gly Thr Trp Lys Arg Glu His Ser Lys His Ser Ser Ser Leu Pro Arg 
2545 ~ 2550 2555 2560 

Val Ser Thr Trp Arg Arg Thr Gly Ser Ser Ser Ser lie Leu Ser Ala 
2565 2570 2575 

Ser Ser Glu Ser Ser Glu Lys Ala Lys Ser Glu Asp Glu Lys His Val 
2580 2585 2590 

Asn Ser lie Ser Gly Thr Lys Gin Ser Lys Glu Asn Gin Val Ser Ala 
2595 2600 2605 

Lys Gly Thr Trp Arg Lys lie Lys Glu Asn Glu Phe Ser Pro Thr Asn 
2610 ~ 2615 2620 

Ser Thr Ser Gin Thr Val Ser Ser Gly Ala Thr Asn Gly Ala Glu Ser 
2625 2630 2635 2640 

Lys Thr Leu He Tyr Gin Met Ala Pro Ala Val Ser Lys Thr Glu Asp 
2645 2650 2655 

Val Trp Val Arg He Glu Asp Cys Pro He Asn Asn Pro Arg Ser Gly 
2660 2665 2670 

Arg Ser Pro Thr Gly Asn Thr Pro Pro Val He Asp Ser Val Ser Glu 
2675 2680 2685 

Lys Ala Asn Pro Asn He Lys Asp Ser Lys Asp Asn Gin Ala Lys Gin 
2690 2695 2700 

Asn Val Gly Asn Gly Ser Val Pro Met Arg Thr Val Gly Leu Glu Asn 
2705 2710 2715 2720 

Arg Leu Asn Ser Phe He Gin Val Asp Ala Pro Asp Gin Lys Gly Thr 
2725 2730 2735 

Glu He Lys Pro Gly Gin Asn Asn Pro Val Pro Val Ser Glu Thr Asn 
2740 2745 2750 

Glu Ser Ser He Val Glu Arg Thr Pro Phe Ser Ser Ser Ser Ser Ser 
2755 2760 2765 

Lys His Ser Ser Pro Ser Gly Thr Val Ala Ala Arg Val Thr Pro Phe 
2770 2775 2780 

Asn Tyr Asn Pro Ser Pro Arg Lys Ser Ser Ala Asp Ser Thr Ser Ala 
2785 2790 2795 2800 

Arg Pro Ser Gin He Pro Thr Pro Val Asn Asn Asn Thr Lys Lys Arg 
2805 2810 2815 

Asp Ser Lys Thr Asp Ser Thr Glu Ser Ser Gly Thr Gin Ser Pro Lys 
2820 2825 2830 

Arg His Ser Gly Ser Tyr Leu Val Thr Ser Val 
2835 " * 2840 



PCT/US97/12677 

WO 98/05347 

-61- 

( 2 ) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 5 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
<iv) ANTI- SENSE: NO 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CGGAATTCNN NNNNNNNAAC AGCNNNNNNN NNAATGAANN NCAAAGTCTG NNNTGAGGAT 60 

65 

CCTCA 
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(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 
95 (A) LENGTH: 6 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CGGAATTCGA CTCAGAANNN NNNAACTTCA GANNNNNNAT CNNNNNNNNN GTCTGAGGAT 6 0 

65 

CCTCA 

(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 65 base pairs 
45 (B) TYPE: nucleic acid 

"* " J (c) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CGGAATTCNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNTGAGGAT 6 0 

65 

CCTCA 
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What is claimed is: 

A composition capable of inhibiting specific binding 
between a signal- transducing protein and a 
cytoplasmic protein containing the amino acid 
sequence (G/S/A/E) -L-G- (F/I/L) , wherein each 
represents a peptide bond, each parenthesis encloses 
amino acids which are alternatives to one other, and 
each slash within such parentheses separating the 
alternative amino acids. 

The composition of claim 1, wherein the cytoplasmic 
protein contains the amino acid sequence (K/R/Q) -X n - 
(G/S/A/E) -L-G- (F/I/L) , wherein X represents any 
amino acid which is selected from the group 
comprising the twenty naturally occurring amino 
acids and n represents at least 2, but not more than 
4 . 

The composition of claim 1, wherein the cytoplasmic 
protein contains the amino acid sequence SLGI . 

4. The composition of claim 1, wherein the signal- 
25 transducing protein has at its carboxyl terminus the 

amino acid sequence (S/T) -X- (V/I/L) , wherein each - 
represents a peptide bond, each parenthesis encloses 
amino acids which are alternatives to one other, 
each slash within such parentheses separating the 
3 0 alternative amino acids, and the X represents any 

amino acid which is selected from the group 
comprising the twenty naturally occurring amino 
acids. 
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The composition of claim 1, wherein the composition 
comprises an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic 
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compound, a polypeptide, or a protein. 

The composition of claim 5, wherein the peptide 
comprises the sequence (S/T) -X- (V/I/L) -COOH, wherein 
each - represents a peptide bond, each parenthesis 
encloses amino acids which are alternatives to one 
other, each slash within such parentheses separating 
the alternative amino acids, the X represents any 
amino acid which is selected from the group 
comprising the twenty naturally occurring amino 
acids . 

The composition of claim 6, wherein the peptide has 
the amino acid sequence DSENSNFRNEIQSLV. 

The composition of claim 6,. wherein the peptide has 
the amino acid sequence RNEIQSLV. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence NEIQSLV. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence EIQSLV. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence IQSLV. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence QSLV. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence SLV. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence IPPDSEDGNEEQSLV . 

The composition of claim 6, wherein the peptide has 
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the amino acid sequence DSEMYNFRSQLASW. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence IDLASEFLFLSNSFL . 

The composition of claim 6, wherein the peptide has 
the amino acid sequence PPTCSQANSGRISTL . 

The composition of claim 6, wherein the peptide has 
the amino acid sequence SDSNMNMNELSEV . 

The composition of claim 6, wherein the peptide has 
the amino acid sequence QNFRTYI VSFV . 

The composition of claim 6, wherein the peptide has 
the amino acid sequence RETIESTV. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence RGFISSLV. 

The composition of claim 6, wherein the peptide has 
the amino acid sequence TIQSVI . 

The composition of claim 6, wherein the peptide has 
the amino acid sequence ESLV. 

The composition of claim 6, wherein the organic 
compound has the sequence Ac-SLV-COOH, wherein the 
Ac represents an acetyl, each - represent a peptide 
bond. 

A composition capable of inhibiting specific binding 
between a signal -transducing protein having at its 
carboxyl terminus the amino acid sequence (S/T) -X- 
(V/I/L) , wherein each - represents a peptide bond, 
each parenthesis encloses amino acids which are 
alternatives to one other, each slash within such 
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parentheses separating the alternative amino acids, 
the X represents any amino acid which is selected 
from the group comprising the twenty naturally 
occurring amino acids, and a cytoplasmic protein. 

26. The composition of claim 25, wherein the composition 
comprises an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic 
compound, a polypeptide or a protein. 

27. A method of identifying a compound capable of 
inhibiting specific binding between a signal- 
transducing protein and a cytoplasmic protein 
containing the amino acid sequence (G/S/A/E) -L-G- 
(F/I/L), wherein each - represents a peptide bond, 
each parenthesis encloses amino acids which are 
alternatives to one other, each slash within such 
parentheses separating the alternative amino acids, 
which comprises : 

(a) contacting the cytoplasmic protein bound to 
the signal-transducing protein with a 
plurality of compounds under conditions 
permitting binding between a known compound 
previously shown to be able to displace the 
signal -transducing protein bound to the 
cytoplasmic protein and the bound cytoplasmic 
protein to form a complex; and 
(b) detecting the displaced signal- transducing 
protein or the complex formed in step (a) , 
wherein the displacement indicates that the 
compound is capable of inhibiting specific 
binding between the signal -transducing protein 
and the cytoplasmic protein. 

28. The method of claim 27, wherein the inhibition of 
specific binding between the signal- transducing 
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protein and the cytoplasmic protein affects the 
transcription activity of a reporter gene. 

The method of claim 28, where in step (b) the 
displaced signal- transducing protein or the complex 
is detected by comparing the transcription activity 
of a reporter gene before and after the contacting 
with the compound in step (a) , where a change of the 
activity indicates that the specific binding between 
the signal- transducing protein and the cytoplasmic 
protein is inhibited and the signal- transducing 
protein is displaced. 

The method of claim 27, wherein the cytoplasmic 
protein is bound to a solid support. 

The method of claim 27, wherein the compound is 
bound to a solid support. 

The method of. claim 27, wherein the compound 
comprises an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic 
compound, a polypeptide or a protein. 

The method of claim 27, wherein the contacting of 
step (a) is in vitro . 

The method of claim 27 , wherein the contacting of 
step (a) is in vivo . 

The method of claim 34, wherein the contacting of 
step (a) is in a yeast cell. 

The method of claim 34, wherein the contacting or 
step (a) is in a mammalian cell. 



The method of claim 27, wherein the signal- 
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transducing protein is a cell surface receptor. 



10 



38. The method of claim 27, wherein the signal- 
transducing protein is a signal transducer protein. 

39. The method of claim 27, wherein the signal- 
transducing protein is a tumor suppressor protein. 

40. The method of claim 37, wherein the cell surface 
protein is the Fas receptor. 

41. The method of claim 40, wherein the Fas receptor is 
expressed in cells derived from organs comprising 
the thymus, liver, kidney, colon, ovary, breast, 
testis, spleen, stomach, prostate, uterus, skin, 
head and neck. 

42. The method of claim 40, wherein the Fas receptor is 
expressed in cells comprising T-cells and B-cells. 

43. The method of claim 37, wherein the cell-surface 
receptor is the CD4 receptor. 

44. The method of claim 37, wherein the cell-surface 
25 receptor is the p75 receptor. 

45. The method of claim 37, wherein the cell-surface 
receptor is the serotonin 2A receptor. 
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46. The method of claim 37, wherein the cell-surface 
receptor is the serotonin 2B receptor. 

47. The method of claim 38, wherein the signal 
transducer protein is Protein Kinase-C-a- type . 

48. The method of claim 39, wherein the tumor suppressor 
protein is adenomatosis polyposis coli tumor 
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suppressor protein. 

The method of claim 39, wherein the tumor suppressor 
protein protein is the colorectal mutant cancer 
protein . 

The method of claim 27, wherein the cytoplasmic 
protein contains the amino acid sequence SLGI , 
wherein each - represents a peptide bond, each 
parenthesis encloses amino acids which are 
alternatives to one other, and each slash within 
such parentheses separating the alternative amino 
. acids . 

The method of claim 40, wherein the cytoplasmic 
protein is Fas-associated phosphatase- 1 . 

A method of identifying a compound capable of 
inhibiting specific binding between a signal - 
transducing protein having at its carboxyl terminus 
the amino acid sequence (S/T) -X- (V/I/L) , wherein 
each - represents a peptide bond, each parenthesis 
encloses amino acids which are alternatives to one 
other, each slash within such parentheses separating 
the alternative amino acids, the X represents any 
amino acid which is selected from the group 
comprising the twenty naturally occurring amino 
acids, and a cytoplasmic protein, which comprises: 

(a) contacting the signal -transducing protein 
bound to the cytoplasmic protein with a 
plurality of compounds under conditions 
permitting binding between a known compound 
previously shown to be able to displace the 
cytoplasmic protein bound to the signal - 
transducing protein and the bound signal- 
transducing protein to form a complex; and 
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(b) detecting the displaced cytoplasmic protein or 
the complex of step (a) wherein the 
displacement indicates that the compound is 
capable of inhibiting specific binding between 
the signal-transducing protein and the 
cytoplasmic protein. 

The method of claim 52, wherein the inhibition of 
specific binding between the signal- transducing 
protein and the cytoplasmic protein affects the 
transcription activity of a reporter gene. 

The method of claim 53, where in step (b) the 
displaced cytoplasmic protein or the complex is 
detected by comparing the transcription activity of 
a reporter gene before and after the contacting with 
the compound in step (a), where a change of the 
activity indicates that the specific binding between 
the signal -transducing protein and the cytoplasmic 
protein is inhibited and the cytoplasmic protein is 
displaced . 

The method of claim 52, wherein the cytoplasmic 
protein is bound to a solid support. 

The method of claim 52, wherein the compound is 
bound to a solid support. 

The method of claim 52, wherein the compound 
comprises an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic 
compound, a polypeptide or a protein. 

The method of claim 52, wherein the contacting of 
step (a) is in vitro. 



The method of claim 52, wherein the contacting 
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step (a) is in vivo . 

The method of claim 59, wherein the contacting of 
step (a) is in a yeast cell. 

The method of claim 59, wherein the contacting or 
step (a) is in a mammalian cell. 

The method of claim 52, wherein the signal- 
transducing protein is a cell surface receptor. 

The method of claim 52, wherein the signal- 
transducing protein is a signal transducer protein. 

The method of claim 52, wherein the signal- 
transducing protein is a tumor suppressor protein. 

The method of claim 62, wherein the cell surface 
protein is the Fas receptor. 

The method of claim 65, wherein the Fas receptor is 
expressed in cells derived from organs comprising 
the thymus, liver, kidney, colon, ovary, breast, 
testis, spleen, stomach, prostate, uterus, skin, 
head and neck. 

The method of claim 65, wherein the Fas receptor is 
expressed in cells comprising T-cells and B-cells. 

The method of claim 62, wherein the cell-surface 
receptor is the CD4 receptor. 

The method of claim 62, wherein the cell-surface 
receptor is the p75 receptor. 

The method of claim 62, wherein the cell-surface 
receptor is the serotonin 2A receptor. 
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The method of claim 62, wherein the cell-surface 
receptor is the serotonin 2B receptor. 

The method of claim 63, wherein the signal 
transducer protein is Protein Kinase-C-a-type . 

The method of claim 64, wherein the tumor suppressor 
protein is adenomatosis polyposis coli tumor 
suppressor protein. 

The method of claim 64, wherein the tumor suppressor 
protein is the colorectal mutant cancer protein. 

The method of claim 52, wherein the cytoplasmic 
protein contains the amino acid sequence SLGI, 
wherein each - represents a peptide bond, each 
parenthesis encloses amino acids which are 
alternatives to one other, and each slash within 
such parentheses separating the alternative amino 
acids . 

The method of claim 52, wherein the cytoplasmic 
protein is Fas-associated phosphatase- 1 . 

A method inhibiting the proliferation of cancer 
cells comprising the composition of claim 1. 

The method of claim 77, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

The method of claim 77, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

A method of inhibiting the proliferation of cancer 
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cells comprising the composition of claim 25. 

81. The method of claim 80, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

82. The method of claim 80, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

83 . A method of inhibiting the proliferation of cancer 
cells comprising the compound identified by the 
method of claim 27. 

15 84. The method of claim 83, wherein the cancer cells are 

derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

20 85. The method of claim 83, wherein the cancer cells are 

derived from cells comprising T-cells and B-cells. 

86. A method of inhibiting the proliferation of cancer 
cells comprising the compound identified by the 

25 method of claim 52. 

87. The method of claim 86, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 

30 stomach, prostate, uterus, skin, head and neck. 

8 8. The method of claim 86, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 
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A method of treating cancer in a subject which 
comprises introducing to the subject's cancerous 
cells an amount of the composition of claim 1 
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effective to result in apoptosis of the cells. 



90 
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91. 



The method of claim 89, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

The method of claim 89, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

10 

92. A method of treating cancer in a subiect which 
comprises introducing to the subject's cancerous 
cells an amount of the composition of claim 25 
effective to result in apoptosis of the cells. 

15 

93. The method of claim 92, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 



20 

94 



The method of claim 92, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 



95. A method of treating cancer in a subject which 
comprises introducing to the subject's cancerous 
cells an amount of the compound identified by the 
method of claim 27 effective to allow apoptosis of 
the cells. 

30 96. The method of claim 95, wherein the cancer cells are 

derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

35 97. The method of claim 95, wherein the cancer cells are 

derived from cells comprising T-cells and B-cells. 
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98. A method of treating cancer in a subject which 
comprises introducing to the subject' s cancerous 
cells an amount of the compound identified by the 
method of claim 52 effective to result in apoptosis 

5 of the cells. 

99. The method of claim 98, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 

10 stomach, prostate, uterus, skin, head and neck. 

100. The method of claim 98, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

15 101. A method of inhibiting the proliferation of virally 

infected cells comprising the composition of claim 
1 . 

102. A method of inhibiting the proliferation of virally 
20 infected cells comprising the composition of claim 

25. 

103. A method of inhibiting the proliferation of virally 
infected cells comprising the compound identified by 

2 5 the method of claim 27. 

104. A method of inhibiting the proliferation of virally 
infected cells comprising the compound identified by 
the method of claim 52 . 

30 

105. The method of claim 101, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein- 
Barr virus, influenza virus, Papilloma virus. Adeno 
virus, Human T-cell lymphtropic virus, type 1 or 

35 HIV. 
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The method of claim 102, wherein the virally 
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infected cells comprise Hepatitis B virus, Epstein- 
Barr virus, influenza virus, Papilloma virus. Adeno 
virus, Human T-cell lymphtropic virus, type 1 or 
HIV. 

5 

107. The method of claim 103, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein- 
Barr virus, influenza virus, Papilloma virus. Adeno 
virus, Human T-cell lymphtropic virus, type 1 or 

10 HIV. 

108. The method of claim 104, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein- 
Barr virus, influenza virus, Papilloma virus. Adeno 

15 virus, Human T-cell lymphtropic virus, type 1 or 

HIV. 

109. A method of treating a virally- infected subject 
which comprises introducing to the subject's 

20 virally- infected cells the composition of claim 1 

effective to result in apoptosis of the cells. 

110. A method of treating a virally- infected subject 
which comprises introducing to the subject's virally 
infected cells the composition of claim 25 effective 
to result in apoptosis of the cells. 

111. A method of treating a virally- infected subject 
which comprises introducing to the subject's 
virally- infected cells an amount of the compound 
identified by the method of claim 27 effective to 
result in apoptosis of the cells. 

112. A method of treating a virally- infected subject 
35 which comprises introducing to the subject's 

virally- infected cells an amount of the compound 
identified by the method of claim 52 effective to 
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result in apoptosis of the cells. 

113. The method of claim 109, wherein the virally 
infected cells comprise the Hepatitis B virus, 

5 Epstein-Barr virus, influenza virus, Papilloma 

virus. Adeno virus, Human T-cell lymphtropic virus, 
type 1 or HIV. 

114. The method of claim 110, wherein the virally 
10 infected cells comprise the Hepatitis B virus, 

Epstein-Barr virus, influenza virus, Papilloma 
virus. Adeno virus, Human T-cell lymphtropic virus, 
type 1 or HIV. 

15 115. The method of claim 111, wherein the virally 

infected cells comprise the Hepatitis B virus, 
Epstein-Barr virus, influenza virus, Papilloma 
virus. Adeno virus, Human T-cell lymphtropic virus, 
type 1 or HIV. 

20 

116. The method of claim 112, wherein the virally 
infected cells comprise the Hepatitis B virus, 
Epstein-Barr virus, influenza virus, Papilloma 
virus. Adeno virus, Human T-cell lymphtropic virus, 

25 type 1 or HIV. 

117. A pharmaceutical composition comprising the 
composition of claim 1 in an effective amount and a 
pharmaceutically acceptable carrier. 



30 



118 . A pharmaceutical composition comprising the 
composition of claim 25 in an effective amount and 
a pharmaceutically acceptable carrier. 



35 



119. 



A pharmaceutical composition comprising the compound 
identified by the method of claim 27 in an effective 
amount and a pharmaceutically acceptable carrier. 
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A pharmaceutical composition comprising the compound 
identified by the method of claim 52 in an effective 
amount and a pharmaceutically acceptable carrier. 
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FIG. 3B 
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FIG. 7C 
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FIG. 7H 

1 maaasydqU kqvealkmen snlrqeledn snhltklete asnmkevlkq Iqgsiedeam 
61 assgqidlle rlkelnldss nfpgvklrsk mslrsygsre gsvssrsgec spvpmgsfpr 
121 rgfSngsres tgyleeleke rsllladldk eekekdwyya qlqnltkrid slpltenfsl 
181 qtdmtrrqle yearqirvam eeqlgtcqdm ekraqrriar iqqxekdxlr irqUqsqat 
241 eaerssqnkh etgshdaerq negqgvgein matsgngqgs ttrmdhetas vlssssthsa 
301 prrltshlgt kvemvyslls mlgthdkddm srtUamsss qdscismrqs gclplUqll 
361 hgndkdsvll gnsrgskear arasaalhni ihsqpddkrg rreirvlhll eqiraycetc 
421 wewqeahepg mdqdknpmpa pvehqicpav cvlmklsfde ehrhamnelg glqaiaellq 
481 vdcemygltn dhysitlrry agmaltnltf gdvankatlc smkgcmralv aqlksesedl 
541 qqviasSlrn Iswradvnsk ktlrevgsvk almecalevk kestlksvls alwnlsahct 
601 enkadicavd galaflvgtl tyrsqtntla iiesgggUr nvssliatne dhrqilrenn 
661 clqtUqhlk shsltivsna cgtlwnlsar npkdqealwd mgavsmlknl ihskhknnam 
721 gsaaalrnlm anrpakykda nimspgsslp slhvrkqkal eaeldaqhls etfdmdnls 
781 pkashrskqr hkqslygdyv fdtnrhddnr sdnfntgnmt vlspylnttv Ipsssssrgs 
841 Idssrsekdr slerergigl gnyhpatenp gtsskrglqi sttaaqiakv meevsaihts 
901 qedrssgstt elhcvtdern alrrssaaht hsntynftks ensnrtcsmp yakleykrss 
961 ndslnsvsss dgygkrgqmk psiesysedd eskfcsygqy padlahkihs anhmddndge 
1021 Idtpinyslk ysdeqlnsgr qspsqnerwa rpkhiiedei kqseqrqsrn qsttypvyte 
1081 stddkhlkfq phfgqqecvs pyrsrgangs etnrvgsnhg xnqnvsqslc qeddyeddkp 
1141 tnyserysee eqheeeerpt nysikyneek rhvdqpidys Ikyatdipss qkqsfsfsks 
1201 ssgqsskteh mssssentst pssnakrqnq Ihpssaqsrs gqpqkaatck vssinqetiq 
1261 tycvedtpic fsrcsslssl ssaedeigcn qttqeadsan tlqiaeikek igtrsaedpv 
1321 sevpavsqhp rtkssrlqgs slssesarhk avefssgaks psksgaqtpk sppehyvqet 
1381 plmfsrctsv ssldsfesrs iassvqsepc sgmvsgiisp sdlpdspgqt JPPsrsktpp 
1441 pppqtaqtkr evpknkapta ekresgpkqa avnaavqrvq vlpdadtllh fatestpdgf 
1501 scssslsals Idepfiqkdv elrimppvqe ndngnetese qpkesnenqe keaektxdse 
1561 kdllddsddd dieileecii samptkssrk akkpaqtask Ipppvarkps qlpvykllps 
1621 qnrlqpqkhv sftpgddmpr vycvegtpin fstatslsdl txesppnela agegvrggaq 
1681 sgef ek?dti ptegfstdea qggktssvti pelddnkaee gdi laecins ampkgkshkp 
1741 frvkkimdqv qqalasssap nknqldgkkk kptspvkpip Pnteyrtrvr knadsknnln 
1801 aervfsdnkd skkqnlknns kdfndklpnn edrvrgsfaf dsphhytpie gtpycfsrnd 
1861 slssldfddd dvdlsrekae Irkakenkes eakvtshtel tsnqqsankt qaiakqpinr 
1921 gqpkpilqkq stfpqsskdi pdrgaatdek Iqnfaientp vcfshnssls slsdidqenn 
1981 nkenepiket eppdsqgeps kpqasgyapk sfhvedtpvc fsrnsslssl sadseddllq 
llll ectslampkk kkpsrlkgdn ekhsprnmgg ilgedltldl kdiqrpdseh Qlspdsenfd 
2101 wkaiqegans ivsslhqaaa aaclsrqass dsdsilslks gislgspfhl tpdqeekpft 
2161 snkqSrilkp gekstletkk ieseskgikg gkkvykslit gkvrsnseis gqmkqplqan 
2221 mpsisrgrtm ihipgvrnss sstspvskkg pplktpasks psegqtatts prgakpsvks 
2281 elspvarqts qiggsskaps rsgsrdstps rpaqqplsrp iqspgrnsis pgrngisppn 
2341 klsqlprtss pstastkssg sgkmsytspg rqmsqqnltk qtglsknass iprsesaskg 
2401 Inqmnngnga nkkvelsrms stkssgsesd rserpvlvrq stfikeapsp tlrrkleesa 
2461 sfeslspssr pasptrsqaq tpvlspslpd mslsthssvq aggwrklppn Isptieyndg 
2521 rpakrhdiar shsespsrlp inrsgtwkre hskhssslpr vstwrrtgss ssxlsasses 
2581 sekaksedek hvnsisgtkq skenqvsakg twrkikenef sptnstsqtv ssgatngaes 
2641 ktliyqmapa vsktedvwvr iedcpinnpr sgrsptgntp pvidsvseka npmkdskdn 
2701 qakqnvgngs vpmrtvglen rlnsfiqvda pdqkgteikp gqnnpvpvse tnessivert 
2761 pfsssssskh sspsgtvaar vtpfnynpsp rkssadstsa rpsqiptpvn nntkkrdskt 
2821 dstessgtqs pkrhsgsylv tsy 
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FIG. 10 
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